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HUMAN SOCS PROTEINS 

TECHNICAL FIELD 

5 This invention relates to nucleic acid and amino acid sequences of human SOCS 

proteins and to the use of these sequences in the diagnosis, treatment, and prevention of 
cancer, immune and neurological disorders, and infectious diseases. 

BACKGROUND OF THE INVENTION 

Signal transduction is a general process in which cells respond to extracellular 
signals (hormones, neurotransmitters, growth and differentiation factors, etc.) through a 
cascade of biochemical reactions beginning with the binding of the signal molecule to a 
cell membrane receptor and ending with an effect on an intracellular target molecule. 

15 Intermediate steps in this process involve the activation of various cytoplasmic proteins by 
phosphorylation via protein kinases and the translocation of some of these activated 
proteins to the cell nucleus, where the transcription of specific genes is affected. The 
signal transduction process regulates all types of cell functions, including cell 
proliferation, differentiation, and gene transcription. 

20 Cytokines are a specific class of extracellular signaUng molecules that control 

growth, differentiation, and various functions of hemopoietic and immune cells. 
Cytokines include interleukins (ILs), colony-stimulating factors (G-CSF and GM-CFS), 
erythropoietin (EPO), and various growth factors (EGF, PDGF, TGF, and FGF; Callard, 
R. and Gearing, A. (1994) The Cytokine Facts Book , pp 2-6, Academic Press, San Diego, 

25 CA). 

Many of the cytokine receptors, including those for the growth factors EGF, 
PDGF, and FGF exhibit intrinsic protein kinase activity. Binding of the cytokine to its 
receptor triggers the autophosphoryiation of a tyrosine residue on the receptor. It is 
believed that these phosphoryiated residues are recognition sites for the binding of other 
30 cytoplasmic signaling proteins which link the initial receptor activation at the cell surface 
to the activation of a specific intracellular target molecule. These signaling proteins 
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contain an src homology 2 (SH2) domain that is a recognition and binding site for the 
phosphotyrosine residue. SH2 domains are found in a variety of signaling molecules and 

oncogenic proteins, such as phospholipase C-y, Ras GTP-ase activating protein, and GRB2 
(Lowenstein, EJ. et al. (1992) Cell 70:431-442). 

5 While much is known about key events in the activation of signaling pathways, 

less is known about how they are switched off Severeal SH2-containing proteins have 
been identified that are induced in murine lymphoid cells by various cytokines, including 
IL-2, IL-3, IL-6, Interferon-y, and EPO (Yoshimura, A. et al. (1995) EMBO Journal 
14:2816-2826; Starr. R. et aL (1997) Nature 387: 917-921; and Naka, T. et al. (1997) 

10 Nature 387: 924-929), A common property of these proteins is the ability to suppress 
growth and differentiation in murine cells. The induction of these SH2-containing 
proteins in cytokine stimulated cells suggests that they may function as negative regulators 
of cytokine signaling. Transcription of the genes encoding four of these proteins, CIS 
(cytokine-inducible SH2-containing protein), and SOCS-1, -2, and -3 (suppressor of 

15 cytokine signaling), is induced by IL-6 both in vitro and in vivo (Starr et aL, supra) . 

The four proteins share little sequence homology in their N-terminal regions, but 
all contain a central SH2 domain and a conserved C-terminal region designated the "SOCS 
box". The function of the SOCS box is unknown. However, a conserved core triplet 
sequence (K/R) (D/E) (Y/F) within the SOCS box is similar to the tyrosine 

20 phosphorylation site recognized by the JAK kinase family. This similarity suggests that 
the SOCS box may provide a site for interaction with, and inhibition of, JAK kinases. The 
finding that SOCS-1 interacts with the catalytic region of JAK kinases supports this 
hypothesis (Endo, T. A. et al. (1997) Nature 387: 921-24). Constitutive expression of 
SOCS-1 in Ml murine lymphoid cells also inhibhs the phosphorylation of certain cell 

25 signaling components (gpl30 and Stat3) in response to IL-6 (Starr et aL, supra ). CIS 
binds to tyrosine-phosphorylated residues in the beta-chain of the IL-3 and EPO receptors 
and provides another possible mechanism for suppressing cell signaling by preventing the 
binding of other signaling proteins (Yoshimura et aL, supra). 

Recently, sixteen additional proteins have been identified containing the SOCS box 

30 domain (Hilton, D.J. et aL (1998) Proc. Natl. Acad. Sci. USA 95:1 14-1 19). Like the SH2- 
containing proteins described above, each of the proteins contains a C-terminal SOCS box 
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and a distinctive motif N-terminal of the SOCS box. In addition to four new SOCS 
proteins containing the SH2 domain, three additional classes of SOCS proteins were 
found containing WD-40 repeats (WSB-1 and -2), SPRY domains (SSB-l to -3), or 
ankyrin repeats (ASB-1 to -3). A class of small GTPases (Rar proteins) that contain the 

5 SOCS box were also identified. The function of WSB, SSB, and ASB proteins are as yet 
unknown. However, like SH2 domains. WD-40 repeats, ankyrin repeats, and SPRY 
domains have been implicated in protein-protein interactions (Hilton et al. supra). 

Defects or alterations in the activity of signaling proteins such as CIS may play a 
role in the development of various proliferative disorders and diseases such as cancer. 

10 Loss or rearrangement of the putative human gene encoding CIS is associated with the 
development of renal cell carcinomas and lung cancer (Yoshimura et al., supra). This 
association suggests that CIS may function as a tumor suppressor gene. 

The discovery of new human SOCS proteins and the polynucleotides encoding 
them satisfies a need in the art by providing new compositions which are useful in the 

15 diagnosis, prevention, and treatment of cancer, immune and neurological disorders, and 
infectious diseases. 



20 SUMMARY OF THE INVENTION 

The invention features substantially purified polypeptides, human SOCS proteins, 
referred to collectively as "HSCOP" and individually as "HSOCP-1", "HSOCP-2", and 
"HSOCP-3", HSOCP-4", HSOCP-5", HS0CP-6'\ HSOCP-7", HSOCP-8", and HSOCP- 
25 9". In one aspect, the invention provides a substantially purified polypeptide comprising 
an amino acid sequence selected from the group consisting of SEQ ID NO: 1-9, and 
fragments thereof. 

The invention further provides a substantially purified variant having at least 90% 
amino acid identity to at least one of the amino acid sequences selected from the group 
30 consisting of SEQ ID NO: 1 -9, and fragments thereof The invention also provides an 
isolated and purified polynucleotide encoding the polypeptide comprising an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1-9, and fragments thereof 
The invention also includes an isolated and purified polynucleotide variant having at least 
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90% polynucleotide sequence identity to the polynucleotide encoding the polypeptide 
comprising an amino acid sequence selected from the group consisting of SEQ ID NO: 1-9, 
and fragments thereof. 

Additionally, the invention provides an isolated and purified polynucleotide which 

5 hybridizes under stringent conditions to the polynucleotide encoding the polypeptide 

comprising an amino acid sequence selected from the group consisting of SEQ ID NO: 1*9, 
and fragments thereof. The invention also provides an isolated and purified 
polynucleotide having a sequence which is complementary to the polynucleotide encoding 
the polypeptide comprising the amino acid sequence selected from the group consisting of 

10 SEQ ID NO: 1 -9. and fragments thereof. 

The invention also provides a method for detecting a polynucleotide in a sample 
containing nucleic acids, the method comprising the steps of (a) hybridizing the 
complement of the polynucleotide encoding the polypeptide comprising an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1-9, and fragments thereof to 

15 at least one of the polynucleotides of the sample, thereby forming a hybridization 
complex; and (b) detecting the hybridization complex, wherein the presence of the 
hybridization complex correlates with the presence of a polynucleotide in the sample. In 
one aspect, the method further comprises amplifying the polynucleotide prior to 
hybridization, 

20 The invention also provides an isolated and purified polynucleotide comprising a 

polynucleotide sequence selected from the group consisting of SEQ ID NO: 10-1 8, and 
fragments thereof. The invention further provides an isolated and purified polynucleotide 
variant having at least 90% polynucleotide sequence identity to the polynucleotide 
sequence selected from the group consisting of SEQ ID NO: 10- 18, and fragments thereof. 

25 The invention also provides an isolated and purified polynucleotide having a sequence 
which is complementary to the polynucleotide comprising a polynucleotide sequence 
selected from the group consisting of SEQ ID NO: 10-1 8, and fragments thereof. 

The invention ftirther provides an expression vector containing at least a fragment 
of the polynucleotide encoding the polypeptide comprising an amino acid sequence 

30 selected from the group consisting of SEQ ID NO: 1 -9, and fragments thereof. In another 
aspect, the expression vector is contained within a host cell. 
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The invention also provides a method for producing a polypeptide, the method 
comprising the steps of: (a) culturing the host cell containing an expression vector 
containing at least a fragment of a polynucleotide under conditions suitable for the 
expression of the polypeptide; and (b) recovering the polypeptide from the host cell 
culture. 

The invention also provides a pharmaceutical composition comprising a 
substantially purified polypeptide having the amino acid sequence selected from the group 
consisting of SEQ ID NO: 1 -9, and fragments thereof, in conjunction with a suitable 
pharmaceutical carrier. 

The invention further includes a purified antibody which binds to a polypeptide 
selected from the group consisting of SEQ ID NO: 1-9, and fragments thereof. The 
invention also provides a purified agonist and a purified antagonist to the polypeptide. 

The invention also provides a method for treating or preventing a disorder 
associated with decreased expression or activity of HSCOP, the method comprising 
administering to a subject in need of such treatment an effective amount of a 
pharmaceutical composition comprising a substantially purified polypeptide having the 
amino acid sequence selected from the group consisting of SEQ ID NO: 1-9, and firagments 
thereof, in conjunction with a suitable pharmaceutical carrier. 

The invention also provides a method for treating or preventing a disorder 
associated with increased expression or activity of HSCOP, the method comprising 
administering to a subject in need of such treatment an effective amount of an antagonist 
of a polypeptide having an amino acid sequence selected from the group consisting of 
SEQ ID NO: 1-9, and fi'agments thereof 

BRIEF DESCRIPTION OF THE TABLES 

Table 1 shows polypeptide and nucleotide sequence identification numbers (SEQ 
ID NOs), clone identification numbers (clone IDs), cDNA libraries, and cDNA fragments 
used to assemble fiiU-length sequences encoding HSCOP. 

Table 2 shows features of each polypeptide sequence, including potential motifs, 
homologous sequences, and methods and algorithms used for identification of HSCOP. 

Table 3 shows the tissue-specific expression patterns of each nucleic acid sequence 
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as determined by northern analysis, diseases, disorders, or conditions associated with these 
tissues, and the vector into which each cDNA was cloned. 

Table 4 describes the tissues used to construct the cDNA libraries from which 
cDNA clones encoding HSCOP were isolated. 
5 Table 5 shows the tools, programs, and algorithms used to analyze HSCOP, along 

with applicable descriptions, references, and threshold parameters. 

DESCRIPTION OF THE INVENTION 

Before the present proteins, nucleotide sequences, and methods are described, it is 
10 understood that this invention is not limited to the particular machines, materials and 
methods described, as these may vary. It is also to be understood that the terminology 
used herein is for the purpose of describing particular embodiments only, and is not 
intended to limit the scope of the present invention which will be limited only by the 
appended claims. 

15 It must be noted that as used herein and in the appended claims, the singular forms 

*'a," '*an," and '"the" include plural reference unless the context clearly dictates otherwise. 
Thus, for example, a reference to "a host cell" includes a plurality of such host cells, and a 
reference to "an antibody" is a reference to one or more antibodies and equivalents thereof 
known to those skilled in the art, and so forth. 

20 Unless defined otherwise, all technical and scientific terms used herein have the 

same meanings as commonly understood by one of ordinary skill in the art to which this 
invention belongs. Although any machines, materials, and methods similar or equivalent 
to those described herein can be used to practice or test the present invention, the preferred 
machines, materials and methods are now described. All publications mentioned herein 

25 are cited for the purpose of describing and disclosing the cell lines, protocols, reagents and 
vectors which are reported in the publications and which might be used in connection with 
the invention. Nothing herein is to be construed as an admission that the invention is not 
entitled to antedate such disclosure by virtue of prior invention. 
DEFINITIONS 

30 "HSCOP" refers to the amino acid sequences of substantially purified HSCOP 

obtained from any species, particularly a mammalian species, including bovine, ovine. 
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porcine, murine, equine, and preferably the human species, from any source, whether 
natural, synthetic, semi-synthetic, or recombinant. 

The term "agonist" refers to a molecule which, when bound to HSCOP, increases 
or prolongs the duration of the effect of HSCOP. Agonists may include proteins, nucleic 
acids, carbohydrates, or any other molecules which bind to and modulate the effect of 
HSCOP. 

An "allelic variant'' is an alternative form of the gene encoding HSCOP. Allelic 
variants may result from at least one mutation in the nucleic acid sequence and may result 
in altered mRNAs or in polypeptides whose structure or function may or may not be 
altered- Any given natural or recombinant gene may have none, one, or many allelic 
forms. Common mutational changes which give rise to allelic variants are generally 
ascribed to natural deletions, additions, or substitutions of nucleotides. Each of these 
types of changes may occur alone, or in combination with the others, one or more times in 
a given sequence. 

"Altered" nucleic acid sequences encoding HSCOP include those sequences with 
deletions, insertions, or substitutions of different nucleotides, resulting in a polynucleotide 
the same as HSCOP or a polypeptide with at least one functional characteristic of HSCOP. 
Included within this definition are polymorphisms which may or may not be readily 
detectable using a particular oligonucleotide probe of the polynucleotide encoding 
HSCOP, and improper or unexpected hybridization to allelic variants, with a locus other 
than the normal chromosomal locus for the polynucleotide sequence encoding HSCOP. 
The encoded protein may also be "altered,*' and may contain deletions, insertions, or 
substitutions of amino acid residues which produce a silent change and result in a 
functionally equivalent HSCOP. Deliberate amino acid substitutions may be made on the 
basis of similarity in polarity, charge, solubility, hydrophobicity, hydrophilicity, and/or the 
amphipathic nature of the residues, as long as the biological or immunological activity of 
HSCOP is retained. For example, negatively charged amino acids may include aspartic 
acid and glutamic acid, positively charged amino acids may include lysine and arginine, 
and amino acids with uncharged polar head groups having similar hydrophilicity values 
may include leucine, isoleucine, and valine; glycine and alanine; asparagine and 
glutamine; serine and threonine; and phenylalanine and tyrosine. 
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The terms ''amino acid'' or ^*amino acid sequence*' refer to an oligopeptide, peptide, 
polypeptide, or protein sequence, or a fragment of any of these, and to naturally occurring 
or synthetic molecules. In this context, '^fragments/' "immunogenic fragments/' or 
'"antigenic fragments'' refer to fragments of HSCOP which are preferably at least 5 to 
about 15 amino acids in length, most preferably at least 14 amino acids, and which retain 
some biological activity or immunological activity of HSCOP. Where "amino acid 
sequence'' is recited to refer to an amino acid sequence of a naturally occurring protein 
molecule, ''amino acid sequence'' and like terms are not meant to limit the amino acid 
sequence to the complete native amino acid sequence associated with the recited protein 
molecule. 

Amplification" relates to the production of additional copies of a nucleic acid 
sequence. Amplification is generally carried out using polymerase chain reaction (PGR) 
technologies well known in the art. 

The term "antagonist" refers to a molecule which, when bound to HSCOP, 
decreases the amount or the duration of the effect of the biological or immunological 
activity of HSCOP. Antagonists may include proteins, nucleic acids, carbohydrates, 
antibodies, or any other molecules which decrease the effect of HSCOP. 

The term "antibody" refers to intact molecules as well as to fragments thereof, such 
as Fab, F(ab')2, and Fv fragments, which are capable of binding the epitopic determinant. 
Antibodies that bind HSCOP polypeptides can be prepared using intact polypeptides or 
using fragments containing small peptides of interest as the immunizing antigen. The 
polypeptide or oligopeptide used to immunize an animal (e.g., a mouse, a rat, or a rabbit) 
can be derived from the translation of RNA, or synthesized chemically, and can be 
conjugated to a carrier protein if desired. Commonly used carriers that are chemically 
coupled to peptides include bovine serum albumin, thyroglobulin, and keyhole limpet 
hemocyanin (KLH). The coupled peptide is then used to immunize the animal 

The term "antigenic determinanf refers to that fragment of a molecule (i.e., an 
epitope) that makes contact with a particular antibody. When a protein or a fragment of a 
protein is used to immunize a host animal, numerous regions of the protein may induce the 
production of antibodies which bind specifically to antigenic determinants (given regions 
or three-dimensional structures on the protein). An antigenic determinant may compete 
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with the intact antigen (i.e., the immunogen used to elicit the immune response) for 
binding to an antibody. 

The term "antisense^' refers to any composition containing a nucleic acid sequence 
which is complementary to the "sense'' strand of a specific nucleic acid sequence. 

5 Antisense molecules may be produced by any method including synthesis or transcription. 
Once introduced into a cell, the complementary nucleotides combine with natural 
sequences produced by the cell to form duplexes and to block either transcription or 
translation. The designation '"negative" can refer to the antisense strand, and the 
designation '"positive" can refer to the sense strand. 

10 The term "biologically active/' refers to a protein having structuraL regulatory, or 

biochemical functions of a naturally occurring molecule. Likewise, "immunologically 
active'' refers to the capability of the natural, recombinant, or synthetic HSCOP, or of any 
oligopeptide thereof, to induce a specific immune response in appropriate animals or cells 
and to bind with specific antibodies. 

15 The terms "complementary'' or "complementarity" refer to the natural binding of 

polynucleotides by base pairing. For example, the sequence "5' A-G-T 3'" bonds to the 
complementary sequence "3' T-C-A 5\" Complementarity between two single-stranded 
molecules may be "partial," such that only some of the nucleic acids bind, or it may be 
"complete," such that total complementarity exists between the single stranded molecules. 

20 The degree of complementarity between nucleic acid strands has significant effects on the 
efficiency and strength of the hybridization between the nucleic acid strands. This is of 
particular importance in amplification reactions, which depend upon binding between 
nucleic acids strands, and in the design and use of peptide nucleic acid (PNA) molecules. 
A "composition comprising a given polynucleotide sequence" or a "composition 

25 comprising a given amino acid sequence" refer broadly to any composition containing the 
given polynucleotide or amino acid sequence. The composition may comprise a dry 
formulation or an aqueous solution. Compositions comprising polynucleotide sequences 
encoding HSCOP or fragments of HSCOP may be employed as hybridization probes. The 
probes may be stored in freeze-dried form and may be associated with a stabilizing agent 

30 such as a carbohydrate. In hybridizations, the probe may be deployed in an aqueous 

solution containing salts (e.g., NaCl), detergents (e.g., sodium dodecyl sulfate; SDS), and 
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other components (e.g., Denhardfs solution, dry milk, salmon sperm DNA, etc.). 

"Consensus sequence'Vefers to a nucleic acid sequence which has been 
resequenced to resolve uncalled bases, extended using XL-PCR kit (Perkin-Elmer, 
Norwalk CT) in the 5' and/or the 3' direction, and resequenced. or which has been 
5 assembled from the overlapping sequences of more than one Incyte Clone using a 
computer program for fragment assembly, such as the GELVIEW Fragment Assembly 
system (GCG, Madison WI). Some sequences have been both extended and assembled to 
produce the consensus sequence. 

The term ^^correlates with expression of a polynucleotide" indicates that the 
1 0 detection of the presence of nucleic acids, the same or related to a nucleic acid sequence 
encoding HSCOP, by northern analysis is indicative of the presence of nucleic acids 
encoding HSCOP in a sample, and thereby correlates with expression of the transcript 
from the polynucleotide encoding HSCOP. 

A ''deletion'' refers to a change in the amino acid or nucleotide sequence that 
15 results in the absence of one or more amino acid residues or nucleotides. 

The term "derivative" refers to the chemical modification of a polypeptide 
sequence, or a polynucleotide sequence. Chemical modifications of a polynucleotide 
sequence can include, for example, replacement of hydrogen by an alkyl, acyl, or amino 
group. A derivative polynucleotide encodes a polypeptide which retains at least one 
20 biological or immunological function of the natural molecule. A derivative polypeptide is 
one modified by glycosylation, pegylation, or any similar process that retains at least one 
biological or immunological function of the polypeptide from which it was derived. 

The term "similarity" refers to a degree of complementarity. There may be partial 
similarity or complete similarity. The word "identity" may substitute for the word 
25 "similarity." A partially complementary sequence that at least partially inhibits an 

identical sequence from hybridizing to a target nucleic acid is referred to as "substantially 
similar." The inhibition of hybridization of the completely complementary sequence to 
the target sequence may be examined using a hybridization assay (Southern or northern 
blot, solution hybridization, and the like) under conditions of reduced stringency. A 
30 substantially similar sequence or hybridization probe will compete for and inhibit the 
binding of a completely similar (identical) sequence to the target sequence under 
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conditions of reduced stringency. This is not to say that conditions of reduced stringency 
are such that non-specific binding is permitted, as reduced stringency conditions require 
that the binding of two sequences to one another be a specific (i.e.. a selective) interaction. 
The absence of non-specific binding may be tested by the use of a second target sequence 

5 which lacks even a partial degree of complementarity (e.g., less than about 30% similarity 
or identity). In the absence of non-specific binding, the substantially similar sequence or 
probe will not hybridize to the second non-complementary target sequence. 

The phrases ''percent identity'' and identity'' refer to the percentage of sequence 
similarity found in a comparison of two or more amino acid or nucleic acid sequences. 

10 Percent identity can be determined electronically, e.g., by using the MEG ALIGN program 
(DNASTAR, Madison WI) which creates alignments between two or more sequences 
according to methods selected by the user, e.g., the clustal method. (See, e.g., Higgins, 
D.G. and P.M. Sharp (1988) Gene 73:237-244.) The clustal algorithm groups sequences 
into clusters by examining the distances between all pairs. The clusters are aligned 

15 pairwise and then in groups. The percentage similarity between two amino acid 
sequences, e.g., sequence A and sequence B, is calculated by dividing the length of 
sequence A, minus the number of gap residues in sequence A, minus the number of gap 
residues in sequence B, into the sum of the residue matches between sequence A and 
sequence B, times one hundred. Gaps of low or of no similarity between the two amino 

20 acid sequences are not included in determining percentage similarity. Percent identity 

between nucleic acid sequences can also be counted or calculated by other methods known 
in the art, e.g., the Jotun Hein method. (See, e.g., Hein, J. (1990) Methods Enzymol. 
1 83:626-645.) Identity between sequences can also be determined by other methods 
knovm in the art, e.g., by varying hybridization conditions. 

25 "Human artificial chromosomes" (HACs) are linear microchromosomes which 

may contain DNA sequences of about 6 kb to 10 Mb in size, and which contain all of the 
elements required for stable mitotic chromosome segregation and maintenance. 

The term "humanized antibody" refers to antibody molecules in which the amino 
acid sequence in the non-antigen binding regions has been altered so that the antibody 

30 more closely resembles a human antibody, and still retains its original binding ability. 

"Hybridization" refers to any process by which a strand of nucleic acid binds with 



-11- 



wo 99/61614 



PCT/US99/11497 



a complementary strand through base pairing. 

The term ''hybridization complex" refers to a complex formed between two nucleic 
acid sequences by virtue of the formation of hydrogen bonds between complementary 
bases, A hybridization complex may be formed in solution (e.g., Qt or Rot analysis) or 
5 formed between one nucleic acid sequence present in solution and another nucleic acid 
sequence immobilized on a solid support (e.g., paper, membranes, filters, chips, pins or 
glass slides, or any other appropriate substrate to which cells or their nucleic acids have 
been fixed). 

The words "insertion" or "addition"' refer to changes in an amino acid or nucleotide 
10 sequence resulting in the addition of one or more amino acid residues or nucleotides, 
respectively, to the sequence found in the naturally occurring molecule. 

"Immune response" can refer to conditions associated with inflammation, trauma, 
immune disorders, or infectious or genetic disease, etc. These conditions can be 
characterized by expression of various factors, e.g., cytokines, chemokines, and other 
1 5 signaling molecules, which may affect cellular and systemic defense systems. 

The term "microarray" refers to an arrangement of distinct polynucleotides on a 
substrate. 

The terms "element" or "array element" in a microarray context, refer to 
hybridizable polynucleotides arranged on the surface of a substrate. 

20 The term "modulate" refers to a change in the activity of HSCOP. For example, 

modulation may cause an increase or a decrease in protein activity, binding characteristics, 
or any other biological, functional, or immvmological properties of HSCOP. 

The phrases "nucleic acid" or "nucleic acid sequence" refer to a nucleotide, 
oligonucleotide, polynucleotide, or any firagment thereof. These phrases also refer to DN A 

25 or RNA of genomic or synthetic origin which may be single-stranded or double-stranded 
and may represent the sense or the antisense strand, to peptide nucleic acid (PN A), or to 
any DNA-like or RNA-like material. In this context, "fragments" refers to those nucleic 
acid sequences which, when translated, would produce polypeptides retaining some 
functional characteristic, e.g., antigenicity, or structural domain characteristic, e.g., ATP- 

30 binding site, of the full-length polypeptide. 

The terms "operably associated" or "operably linked" refer to functionally related 
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nucleic acid sequences. A promoter is operably associated or operably linked with a 
coding sequence if the promoter controls the translation of the encoded polypeptide. 
While operably associated or operably linked nucleic acid sequences can be contiguous 
and in the same reading frame, certain genetic elements, e.g., repressor genes, are not 

5 contiguously linked to the sequence encoding the polypeptide but still bind to operator 
sequences that control expression of the polypeptide. 

The term ''oligonucleotide" refers to a nucleic acid sequence of at least about 6 
nucleotides to 60 nucleotides, preferably about 15 to 30 nucleotides, and most preferably 
about 20 to 25 nucleotides, which can be used in PGR amplification or in a hybridization 

1 0 assay or microarray. ''Oligonucleotide" is substantially equivalent to the terms 

"amplimer," "primerj* "oligomen" and ''probe." as these terms are commonly defined in 
the art. 

"Peptide nucleic acid" (PNA) refers to an antisense molecule or anti-gene agent 
which comprises an oligonucleotide of at least about 5 nucleotides in length linked to a 
15 peptide backbone of amino acid residues ending in lysine. The terminal lysine confers 
solubility to the composition. PNAs preferentially bind complementary single stranded 
DNA or RN A and stop transcript elongation, and may be pegylated to extend their lifespan 
in the cell. 

The term "sample" is used in its broadest sense. A sample suspected of containing 
20 nucleic acids encoding HSCOP, or fragments thereof, or HSCOP itself, may comprise a 
bodily fluid; an extract from a cell, chromosome, organelle, or membrane isolated from a 
cell; a cell; genomic DNA, RNA, or cDNA, in solution or bound to a substrate; a tissue; a 
tissue print; etc. 

The terms ''specific binding" or ''specifically binding" refer to that interaction 
25 between a protein or peptide and an agonist an antibody, or an antagonist. The interaction 
is dependent upon the presence of a particular structure of the protein, e.g., the antigenic 
determinant or epitope, recognized by the binding molecule. For example, if an antibody 
is specific for epitope "A," the presence of a polypeptide containing the epitope A, or the 
presence of free unlabeled A, in a reaction containing free labeled A and the antibody will 
30 reduce the amount of labeled A that binds to the antibody. 

The term "stringent conditions'' refers to conditions which permit hybridization 
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between polynucleotides and the claimed polynucleotides. Stringent conditions can be 
defined by salt concentration, the concentration of organic solvent, e.g., formamide, 
temperature, and other conditions well known in the art. In particular, stringency can be 
increased by reducing the concentration of salt, increasing the concentration of formamide, 
5 or raising the hybridization temperature. 

The term "substantially purified" refers to nucleic acid or amino acid sequences 
that are removed from their natural environment and are isolated or separated, and are at 
least about 60% free, preferably about 75% free, and most preferably about 90% free from 
other components with which they are naturally associated. 
10 A "substitution" refers to the replacement of one or more amino acids or 

IP nucleotides by different amino acids or nucleotides, respectively. 

J^l "Substrate" refers to any suitable rigid or semi-rigid support including membranes, 

Q filters, chips, slides, wafers, fibers, magnetic or nonmagnetic beads, gels, tubing, plates, 

ry polymers, microparticles and capillaries. The substrate can have a variety of surface 

JJt 1 5 forms, such as wells, trenches, pins, channels and pores, to which polynucleotides or 

; polypeptides are bound. 

=^ J "Transformation" describes a process by which exogenous DNA enters and 

J2 changes a recipient cell. Transformation may occur under natural or artificial conditions 

O according to various methods well known in the art, and may rely on any known method 

20 for the insertion of foreign nucleic acid sequences into a prokaryotic or eukaryotic host 
cell. The method for transformation is selected based on the type of host cell being 
transformed and may include, but is not limited to, viral infection, electroporation, heat 
shock, lipofection, and particle bombardment. The term "transformed" cells includes 
stably transformed cells in which the inserted DNA is capable of replication either as an 
25 autonomously replicating plasmid or as part of the host chromosome, as well as transiently 
transformed cells which express the inserted DNA or RNA for limited periods of time. 

A "variant" of HSCOP polypeptides refers to an amino acid sequence that is 
altered by one or more amino acid residues. The variant may have "conservative" 
changes, wherein a substituted amino acid has similar structural or chemical properties 
30 (e.g., replacement of leucine with isoleucine). More rarely, a variant may have 

"nonconservative" changes (e.g., replacement of glycine with tryptophan). Analogous 
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minor variations may also include amino acid deletions or insertions, or both. Guidance in 
determining which amino acid residues may be substituted, inserted, or deleted without 
abolishing biological or immunological activity may be found using computer programs 
well known in the art, for example, LASERGENE software (DNASTAR). 

5 The term "variant," when used in the context of a polynucleotide sequence, may 

encompass a polynucleotide sequence related to HSCOP. This definition may also 
include, for example, "allelic"' (as defined above), "splice/' "species.'' or "polymorphic" 
variants. A splice variant may have significant identity to a reference molecule, but will 
generally have a greater or lesser number of polynucleotides due to alternate splicing of 

10 exons during mRNA processing. The corresponding polypeptide may possess additional 
functional domains or an absence of domains. Species variants are polynucleotide 
sequences that vary from one species to another. The resulting polypeptides generally will 
have significant amino acid identity relative to each other. A polymorphic variant is a 
variation in the polynucleotide sequence of a particular gene between individuals of a 

15 given species. Polymorphic variants also may encompass "single nucleotide 

polymorphisms" (SNPs) in which the polynucleotide sequence varies by one base. The 
presence of SNPs may be indicative of, for example, a certain population, a disease state^ 
or a propensity for a disease state. 

20 THE INVENTION 

The invention is based on the discovery of new human SOCS proteins (HSCOP), 
the polynucleotides encoding HSCOP, and the use of these compositions for the diagnosis, 
treatment, or prevention of cancer, immune and neurological disorders, and infectious 
diseases. 

25 Table 1 lists the Incyte clones used to assemble full length nucleotide sequences 

encoding HSCOP. Columns 1 and 2 show the sequence identification numbers (SEQ ID 
NOs) of the polypeptide and nucleotide sequences, respectively. Column 3 shows the 
clone IDs of the Incyte clones in which nucleic acids encoding each HSCOP were 
identified, and column 4 shows the cDNA libraries from which these clones were isolated. 

30 Column 5 shows Incyte clones and their corresponding cDNA libraries. Clones for which 
cDNA libraries are not indicated were derived from pooled cDNA libraries. The clones in 
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column 5 were used to assemble the consensus nucleotide sequence of each HSCOP and 
are useful as fragments in hybridization technologies. 

The columns of Table 2 show various properties of each of the polypeptides of the 
invention: column 1 references the SEQ ID NO: column 2 shows the number of amino 

5 acid residues in each polypeptide; column 3, potential phosphorylation sites; column 4, 
potential glycosylation sites: column 5, the amino acid residues comprising signature 
sequences and motifs; column 6, homologous sequences; and column 7, analytical 
methods used to identify each protein through sequence homology and protein motifs. 
The columns of Table 3 show the tissue-specificity and diseases, disorders, or 

10 conditions associated with nucleotide sequences encoding HSCOP. The first column of 
Table 3 lists the nucleotide SEQ ID NOs. Column 2 lists tissue categories which express 
HSCOP as a fraction of total tissue categories expressing HSCOP. Column 3 lists 
diseases, disorders, or conditions associated with those tissues expressing HSCOP. 
Column 4 lists the vectors used to subclone the cDNA library. 

] 5 The columns of Table 4 show descriptions of the tissues used to construct the 

cDNA libraries from which cDNA clones encoding HSCOP were isolated. Column 1 
references the nucleotide SEQ ID NOs, column 2 shows the cDNA libraries from which 
these clones were isolated, and column 3 shows the tissue origins and other descriptive 
information relevant to the cDNA libraries in column 2. 

20 The following fragments of the nucleotide sequences encoding HSCOP are useful, 

for example, in hybridization or amplification technologies to identify SEQ ID NO: 10- 18 
and to distinguish between SEQ ID NO: 10-1 8 and related polynucleotide sequences. 
Useful fragments include the fragment of SEQ ID NO: 10 from about nucleotide 243 to 
about nucleotide 317; the fragment of SEQ ID N0:1 1 from about nucleotide 834 to about 

25 nucleotide 911; the fi*agment of SEQ ID NO: 1 2 from about nucleotide 233 to about 
nucleotide 298; the fragment of SEQ ID NO: 13 from about nucleotide 242 to about 
nucleotide 307; the fragment of SEQ ID NO: 14 from about nucleotide 447 to about 
nucleotide 524; the fragment of SEQ ID NO: 1 5 from about nucleotide 287 to about 
nucleotide 364; the fi-agment of SEQ ID NO: 16 from about nucleotide 100 to about 

30 nucleotide 138; the fragments of SEQ ID NO: 17 from about nucleotide 433 to about 

nucleotide 477 and from about nucleotide 1 1 89 to about nucleotide 1233; and the fragment 
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of SEQ ID NO: 1 8 from about nucleotide 973 to about nucleotide 1017. Polypeptides 
encoded by these fragments are useful, for example, as immunogenic peptides. 

The invention also encompasses HSCOP variants. A preferred HSCOP variant is 
one which has at least about 80%. more preferably at least about 90%, and most preferably 
5 at least about 95% amino acid sequence identity to the HSCOP amino acid sequence, and 
which contains at least one functional or structural characteristic of HSCOP. 

The invention also encompasses polynucleotides which encode HSCOP. In a 
particular embodiment, the invention encompasses a polynucleotide sequence comprising 
a sequence selected from the group consisting of SEQ ID NO: 10-18, which encodes 
10 HSCOP. 

The invention also encompasses a variant of a polynucleotide sequence encoding 
HSCOP. In particular, such a variant polynucleotide sequence will have at least about 
70%, more preferably at least about 85%, and most preferably at least about 95%o 
polynucleotide sequence identity to the polynucleotide sequence encoding HSCOP. A 

15 particular aspect of the invention encompasses a variant of a polynucleotide sequence 
comprising a sequence selected from the group consisting of SEQ ID NO: 10-1 8 which has 
at least about 70%, more preferably at least about 85%, and most preferably at least about 
95% polynucleotide sequence identity to a nucleic acid sequence selected from the group 
consisting of SEQ ID NO: 10-1 8. Any one of the polynucleotide variants described above 

20 can encode an amino acid sequence which contains at least one functional or structural 
characteristic of HSCOP. 

It will be appreciated by those skilled in the art that as a result of the degeneracy of 
the genetic code, a multitude of polynucleotide sequences encoding HSCOP, some bearing 
minimal similarity to the polynucleotide sequences of any known and naturally occurring 

25 gene, may be produced. Thus, the invention contemplates each and every possible 

variation of polynucleotide sequence that could be made by selecting combinations based 
on possible codon choices. These combinations are made in accordance with the standard 
triplet genetic code as applied to the polynucleotide sequence of naturally occurring 
HSCOP, and all such variations are to be considered as being specifically disclosed. 

30 Although nucleotide sequences which encode HSCOP and its variants are 

preferably capable of hybridizing to the nucleotide sequence of the naturally occurring 
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HSCOP under appropriately selected conditions of stringency, it may be advantageous to 
produce nucleotide sequences encoding HSCOP or its derivatives possessing a 
substantially different codon usage, e.g., inclusion of non-naturally occurring codons. 
Codons may be selected to increase the rate at which expression of the peptide occurs in a 

5 particular prokaryotic or eukaryotic host in accordance with the frequency with which 
particular codons are utilized by the host. Other reasons for substantially altering the 
nucleotide sequence encoding HSCOP and its derivatives without altering the encoded 
amino acid sequences include the production of RN A transcripts having more desirable 
properties, such as a greater half-life, than transcripts produced from the naturally 

10 occurring sequence. 

The invention also encompasses production of DNA sequences which encode 
HSCOP and HSCOP derivatives, or fragments thereof, entirely by synthetic chemistry. 
After production, the synthetic sequence may be inserted into any of the many available 
expression vectors and cell systems using reagents well known in the art. Moreover, 

15 synthetic chemistry may be used to introduce mutations into a sequence encoding HSCOP 
or any fragment thereof. 

Also encompassed by the invention are polynucleotide sequences that are capable 
of hybridizing to the claimed polynucleotide sequences, and, in particular, to those shown 
in SEQ ID NO: 10-1 8 and fragments thereof under various conditions of stringency. (See, 

20 e.g., Wahl, G.M. and S.L. Berger (1987) Methods Enzymol. 152:399-407; Kimmel, A.R. 
(1987) Methods Enzymol. 152:507-51 1.) For example, stringent salt concentration will 
ordinarily be less than about 750 mM NaCl and 75 mM trisodium citrate, preferably less 
than about 500 mM NaCl and 50 mM trisodium citrate, and most preferably less than 
about 250 mM NaCl and 25 mM trisodium citrate. Low stringency hybridization can be 

25 obtained in the absence of organic solvent, e.g., formamide, while high stringency 

hybridization can be obtained in the presence of at least about 35% formamide, and most 
preferably at least about 50% formamide. Stringent temperature conditions will ordinarily 
include temperatures of at least about 30°C, more preferably of at least about 37°C, and 
most preferably of at least about 42°C. Varying additional parameters, such as 

30 hybridization time, the concentration of detergent, e.g., sodium dodecyl sulfate (SDS), and 
the inclusion or exclusion of carrier DNA, are well known to those skilled in the art. 
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Various levels of stringency are accomplished by combining these various conditions as 
needed. In a preferred embodiment, hybridization will occur at 30°C in 750 mM NaCl, 75 
mM trisodium citrate, and 1% SDS. In a more preferred embodiment, hybridization w^ill 
occur at 37°C in 500 mM NaCU 50 mM trisodium citrate. 1% SDS, 35% formamide, and 

5 100 Aig/ml denatured salmon sperm DNA (ssDNA). In a most preferred embodiment, 
hybridization will occur at 42''C in 250 mM NaCL 25 mM trisodium citrate, 1% SDS, 50 
% formamide, and 200 jug/ml ssDNA. Useful variations on these conditions will be 
readily apparent to those skilled in the art. 

The washing steps which follow hybridization can also vary in stringency. Wash 

10 stringency conditions can be defined by salt concentration and by temperature. As above, 
wash stringency can be increased by decreasing salt concentration or by increasing 
temperature. For example, stringent salt concentration for the wash steps will preferably 
be less than about 30 mM NaCl and 3 mM trisodium citrate, and most preferably less than 
about 15 mM NaCl and 1.5 mM trisodium citrate. Stringent temperature conditions for 

15 the wash steps will ordinarily include temperature of at least about 25 ^C, more preferably 
of at least about 42^C, and most preferably of at least about 68°C. In a preferred 
embodiment, wash steps will occur at 25°C in 30 mM NaCl, 3 mM trisodium citrate, and 
0.1% SDS, In a more preferred embodiment, wash steps will occur at 42^C in 15 mM 
NaCl, 1 .5 mM trisodium citrate, and 0.1% SDS. In a most preferred embodiment, wash 

20 Steps will occur at 68°C in 15 mM NaCl, 1 .5 mM trisodium citrate, and 0.1% SDS. 
Additional variations on these conditions will be readily apparent to those skilled in the 
art. 

Methods for DNA sequencing are well known in the art and may be used to 
practice any of the embodiments of the invention. The methods may employ such 

25 enzymes as the Klenow fragment of DNA polymerase I, SEQUENASE (US Biochemical, 
Cleveland OH), Taq polymerase (Perkin-Elmer), thermostable T7 polymerase (Amersham 
Phamiacia Biotech, Piscataway NJ), or combinations of polymerases and proofreading 
exonucleases such as those found in the ELONGASE amplification system (Life 
Technologies, Gaithersburg MD). Preferably, sequence preparation is automated with 

30 machines such as the Hamilton MICROLAB 2200 (Hamilton, Reno NV), Peltier Thermal 
Cycler 200 (PTC200; MJ Research, Watertown MA) and the ABI CATALYST 800 
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(Perkin-Elmer). Sequencing is then carried out using the ABI 373 or 377 DNA 
sequencing systems (Perkin-Elmer), the MEG AB ACE 1 000 DNA sequencing system 
(Molecular Dynamics, Sunnyvale CA), or other systems known in the art. The resulting 
sequences are analyzed using a variety of algorithms which are well known in the art. 
5 (See. e.g., Ausubel, P.M. (1997) Short Protocols in Molecular Biology . John Wiley & 
Sons. New York NY, unit 7.7; Meyers, R.A. (1995) Molecular Biology and 
Biotechnology . Wiley VCH, New York NY, pp. 856-853.) 

The nucleic acid sequences encoding HSCOP may be extended utilizing a partial 
nucleotide sequence and employing various PCR-based methods known in the art to detect 
10 upstream sequences, such as promoters and regulatory elements. For example, one 

method which may be employed, restriction-site PCR. uses universal and nested primers 
to amplify unknown sequence from genomic DNA within a cloning vector. (See, e.g., 
Sarkar, G. (1993) PCR Methods AppHc. 2:318-322.) Another method, inverse PCR, uses 
primers that extend in divergent directions to amplify unknown sequence from a 
1 5 circularized template. The template is derived from restriction fragments comprising a 
known genomic locus and surrounding sequences. (See, e.g., Triglia. T. et al. (1988) 
Nucleic Acids Res. 16:8186.) A third method, capture PCR, involves PCR amplification 
of DNA fragments adjacent to known sequences in human and yeast artificial chromosome 
DNA. (See, e.g., Lagerstrom, M. et al. (1991) PCR Methods Applic. 1:111-1 19.) In this 
20 method, multiple restriction enzyme digestions and ligations may be used to insert an 
engineered double-stranded sequence into a region of unknown sequence before 
performing PCR. Other methods which may be used to retrieve unknown sequences are 
known in the art. (See, e.g., Parker, J.D. et al. (1991) Nucleic Acids Res. 19:3055-306). 
Additionally, one may use PCR, nested primers, and PROMOTERFINDER libraries 
25 (Clontech, Palo Alto CA) to walk genomic DNA. This procedure avoids the need to 

screen libraries and is useful in finding intron/exon junctions. For all PCR-based methods, 
primers may be designed using commercially available software, such as OLIGO 4.06 
Primer Analysis software (National Biosciences, Plymouth MN) or another appropriate 
program, to be about 22 to 30 nucleotides in length, to have a GC content of about 50% or 
30 more, and to anneal to the template at temperatures of about 6S'^C to 72°C. 

When screening for full-length cDNAs, it is preferable to use libraries that have 
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been size-selected to include larger cDNAs. In addition, random-primed libraries, which 
often include sequences containing the 5' regions of genes, are preferable for situations in 
which an oligo d(T) library does not yield a full-length cDNA. Genomic libraries may be 
useful for extension of sequence into 5' non-transcribed regulatory regions. 

5 Capillary electrophoresis systems which are commercially available may be used 

to analyze the size or confirm the nucleotide sequence of sequencing or PCR products. In 
particular, capillary sequencing may employ flowable polymers for electrophoretic 
separation, four different nucleotide-specific, laser-stimulated fluorescent dyes, and a 
charge coupled device camera for detection of the emitted wavelengths. Output/light 

10 intensity may be converted to electrical signal using appropriate software (e.g., 

GENOTYPER and SEQUENCE NAVIGATOR. Perkin-Elmer), and the entire process 
from loading of samples to computer analysis and electronic data display may be computer 
controlled. Capillary electrophoresis is especially preferable for sequencing small DNA 
fragments which may be present in limited amounts in a particular sample. 

1 5 In another embodiment of the invention, polynucleotide sequences or fragments 

thereof which encode HSCOP may be cloned in recombinant DNA molecules that direct 
expression of HSCOP, or fragments or functional equivalents thereof, in appropriate host 
cells. Due to the inherent degeneracy of the genetic code, other DNA sequences which 
encode substantially the same or a functionally equivalent amino acid sequence may be 

20 produced and used to express HSCOP. 

The nucleotide sequences of the present invention can be engineered using 
methods generally known in the art in order to alter HSOCH-encoding sequences for a 
variety of purposes including, but not limited to, modification of the cloning, processing, 
and/or expression of the gene product. DNA shuffling by random fragmentation and PCR 

25 reassembly of gene fragments and synthetic oligonucleotides may be used to engineer the 
nucleotide sequences. For example, oligonucleotide-mediated site-directed mutagenesis 
may be used to introduce mutations that create new restriction sites, alter glycosylation 
patterns, change codon preference, produce splice variants, and so forth. 

In another embodiment, sequences encoding HSCOP may be synthesized, in whole 

30 or in part, using chemical methods well known in the art. (See, e.g., Caruthers. M,H. et al. 
(1980) Nuci. Acids Res. Symp. Ser. 215-223, and Horn, T. et al. (1980) Nucl. Acids Res. 
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Symp. Ser. 225-232.) Alternatively, HSCOP itself or a fragment thereof may be 
synthesized using chemical methods. For example, peptide synthesis can be performed 
using various solid-phase techniques. (See, e.g., Roberge, J.Y. et al. (1995) Science 
269:202-204.) Automated synthesis may be achieved using the ABI 431 A Peptide 

5 Synthesizer (Perkin-Elmer). Additionally, the amino acid sequence of HSCOP. or any 
part thereof, may be altered during direct synthesis and/or combined with sequences from 
other proteins, or any part thereof, to produce a variant polypeptide. 

The peptide may be substantially purified by preparative high performance liquid 
chromatography. (See, e.g, Chiez, R.M. and F.Z, Regnier (1990) Methods Enzymol. 

10 182:392-421 .) The composition of the synthetic peptides may be confirmed by amino acid 
analysis or by sequencing. (See, e.g., Creighton, T. (1984) Proteins, Structures and 
Molecular Properties . WH Freeman. New York NY.) 

In order to express a biologically active HSCOP, the nucleotide sequences 
encoding HSCOP or derivatives thereof may be inserted into an appropriate expression 

15 vector, i.e., a vector which contains the necessary elements for transcriptional and 

translational control of the inserted coding sequence in a suitable host. These elements 
include regulatory sequences, such as enhancers, constitutive and inducible promoters, and 
5' and 3' untranslated regions in the vector and in polynucleotide sequences encoding 
HSCOP. Such elements may vary in their strength and specificity. Specific initiation 

20 signals may also be used to achieve more efficient translation of sequences encoding 
HSCOP. Such signals include the ATG initiation codon and adjacent sequences, e.g. the 
Kozak sequence. In cases where sequences encoding HSCOP and its initiation codon and 
upstream regulatory sequences are inserted into the appropriate expression vector, no 
additional transcriptional or translational control signals may be needed. However, in 

25 cases where only coding sequence, or a fragment thereof, is inserted, exogenous 
translational control signals including an in-frame ATG initiation codon should be 
provided by the vector. Exogenous translational elements and initiation codons may be of 
various origins, both natural and synthetic. The efficiency of expression may be enhanced 
by the inclusion of enhancers appropriate for the particular host cell system used. (See, 

30 e.g., Scharf, D. et al. (1994) Results Probl. Cell Differ. 20:125-162.) 

Methods which are well known to those skilled in the art may be used to construct 
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expression vectors containing sequences encoding HSCOP and appropriate transcriptional 
and translationai control elements. These methods include in vitro recombinant DNA 
techniques, synthetic techniques, and in vivo genetic recombination. (See, e.g., Sambrook, 
J. et al. (1989) Molecular Clonings A Laboratory Manual . Cold Spring Harbor Press, 

5 Plainview NY, ch. 4, 8, and 16-17; AusubeK F.M. et al. (1995) Current Protocols in 
Molecular Biology . John Wiley & Sons, New York NY, ch. 9, 13, and 16.) 

A variety of expression vector/host systems may be utilized to contain and express 
sequences encoding HSCOP. These include, but are not limited to, microorganisms such 
as bacteria transformed v^ith recombinant bacteriophage, plasmid, or cosmid DNA 

10 expression vectors; yeast transformed with yeast expression vectors; insect cell systems 
infected with viral expression vectors (e.g., baculovirus); plant cell systems transformed 
with viral expression vectors (e.g., cauliflower mosaic virus. CaMV, or tobacco mosaic 
virus,TMV) or with bacterial expression vectors (e.g., Ti or pBR322 plasmids); or animal 
cell systems. The invention is not limited by the host cell employed. 

15 In bacterial systems, a number of cloning and expression vectors may be selected 

depending upon the use intended for polynucleotide sequences encoding HSCOP. For 
example, routine cloning, subcloning, and propagation of polynucleotide sequences 
encoding HSCOP can be achieved using a multifunctional E. coli vector such as 
PBLUESCRIPT (Stratagene. La Jolla CA) or pSPORTl plasmid (Life Technologies). 

20 Ligation of sequences encoding HSCOP into the vector's multiple cloning site disrupts the 
lacZ gene, allowing a colorimetric screening procedure for identification of transformed 
bacteria containing recombinant molecules. In addition, these vectors may be useful for in 
vitro transcription, dideoxy sequencing, single strand rescue with helper phage, and 
creation of nested deletions in the cloned sequence. (See, e.g.. Van Heeke, G, and S.M. 

25 Schuster (1989) J. Biol. Chem. 264:5503-5509.) When large quantities of HSCOP are 
needed, e.g. for the production of antibodies, vectors which direct high level expression of 
HSCOP may be used. For example, vectors containing the strong, inducible T5 or T7 
bacteriophage promoter may be used. 

Yeast expression systems may be used for production of HSCOP, A number of 

30 vectors containing constitutive or inducible promoters, such as alpha factor, alcohol 

oxidase, and PGH, may be used in the yeast Saccharomvces cerevisiae or Pichia pastoris . 
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In addition, such vectors direct either the secretion or intracellular retention of expressed 
proteins and enable integration of foreign sequences into the host genome for stable 
propagation. (See, e.g., Ausubel, 1995, supra : Grant et al. (1987) Methods Enzymol. 
153:516-54; and Scorer, C. A. et al. (1994) Bio/Technology 12:181-184.) 

5 Plant systems may also be used for expression of HSCOP, Transcription of 

sequences encoding HSCOP may be driven viral promoters, e.g., the 35S and 19S 
promoters of CaMV used alone or in combination with the omega leader sequence from 
TMV (Takamatsu, N. (1987) EMBO J. 6:307-3 1 1 ). Alternatively, plant promoters such as 
the small subunit of RUBISCO or heat shock promoters may be used, (See, e.g., Coruzzi, 

10 G. et al (1984) EMBO J. 3:1671-1680: Broglie, R. et aL (1984) Science 224:838-843; and 
Winter, J. et aL (1991) Results Probl. Cell Differ. 17:85-105.) These constructs can be 
introduced into plant cells by direct DNA transformation or pathogen-mediated 
transfection. (See, e.g., The McGraw Hill Yearbook of Science and Technology (1 992) 
McGraw Hill, New York NY, pp. 191-196.) 

15 In mammalian cells, a number of viral -based expression systems may be utilized. 

In cases where an adenovirus is used as an expression vector, sequences encoding HSCOP 
may be ligated into an adenovirus transcription/translation complex consisting of the late 
promoter and tripartite leader sequence. Insertion in a non-essential El or E3 region of the 
viral genome may be used to obtain infective virus which expresses HSCOP in host cells, 

20 (See, e.g., Logan, J. and T. Shenk (1984) Proc. Natl. Acad. Sci. 81 :3655-3659.) In 

addition, transcription enhancers, such as the Rous sarcoma virus (RSV) enhancer, may be 
used to increase expression in mammalian host cells. SV40 or EBV-based vectors may 
also be used for high-level protein expression. 

Human artificial chromosomes (HACs) may also be employed to deliver larger 

25 fragments of DNA than can be contained in and expressed from a plasmid. HACs of about 
6 kb to 10 Mb are constructed and delivered via conventional delivery methods 
(liposomes, polycationic amino polymers, or vesicles) for therapeutic purposes. (See, e.g., 
Hanington, J.J. et aL (1997) Nat Genet. 15:345-355.) 

For long term production of recombinant proteins in mammalian systems, stable 

30 expression of HSCOP in cell lines is preferred. For example, sequences encoding HSCOP 
can be transformed into cell lines using expression vectors which may contain viral origins 
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of replication and/or endogenous expression elements and a selectable marker gene on the 
same or on a separate vector. Following the introduction of the vector, cells may be 
allowed to grow for about 1 to 2 days in enriched media before being switched to selective 
media. The purpose of the selectable marker is to confer resistance to a selective agent, 
5 and its presence allows growth and recovery of cells which successfully express the 
introduced sequences. Resistant clones of stably transformed cells may be propagated 
using tissue culture techniques appropriate to the cell type. 

Any number of selection systems may be used to recover transformed cell lines. 
These include, but are not limited to, the herpes simplex virus thymidine kinase and 
iO adenine phosphoribosyitransferase genes, for use in tk or apr cells, respectively. (See, 
e.g., Wigler, M. et al. (1977) Cell 1 1 :223-232: Lowy, I. et al. (1980) Cell 22:817-823.) 
Also, antimetabolite, antibiotic, or herbicide resistance can be used as the basis for 
selection. For example, dhfr confers resistance to methotrexate; neo confers resistance to 
the aminoglycosides, neomycin and G-41 8; and als or pat confer resistance to 
15 chiorsulfuron and phosphinotricin acetyltransferase, respectively. (See, e.g., Wigier, M. et 
al. (1980) Proc. Natl. Acad. Sci. 77:3567-3570; Colbere-Garapin, F. et al. (1981) J. Mol. 
Biol. 150:1-14.) Additional selectable genes have been described, e.g., trpB and hisD^ 
which alter cellular requirements for metabolites. (See, e.g., Hartman. S.C, and R.C. 
Mulligan (1988) Proc. Natl. Acad. Sci. 85:8047-8051.) Visible markers, e.g., 
20 anthocyanins. green fluorescent proteins (GFP; Clontech), 6 glucuronidase and its 
substrate ft-glucuronide, or luciferase and its substrate luciferin may be used. These 
markers can be used not only to identify transformants, but also to quantify the amount of 
transient or stable protein expression attributable to a specific vector system. (See, e.g., 
Rhodes, C.A. (1995) Methods Mol. Biol. 55:121-131.) 
25 Although the presence/absence of marker gene expression suggests that the gene of 

interest is also present, the presence and expression of the gene may need to be confirmed. 
For example, if the sequence encoding HSCOP is inserted within a marker gene sequence, 
transformed cells containing sequences encoding HSCOP can be identified by the absence 
of marker gene function. Alternatively, a marker gene can be placed in tandem with a 
30 sequence encoding HSCOP under the control of a single promoter. Expression of the 
marker gene in response to induction or selection usually indicates expression of the 
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tandem gene as well. 

In general host cells that contain the nucleic acid sequence encoding HSCOP and 
that express HSCOP may be identified by a variety of procedures known to those of skill 
in the art. These procedures include, but are not limited to, DNA-DNA or DNA-RNA 
5 hybridizations, PGR amplification, and protein bioassay or immunoassay techniques 
which include membrane, solution, or chip based technologies for the detection and/or 
quantification of nucleic acid or protein sequences. 

Immunological methods for detecting and measuring the expression of HSCOP 
using either specific polyclonal or monoclonal antibodies are known in the art. Examples 
10 of such techniques include enzyme-linked immunosorbent assays (ELlSAs), 

radioimmunoassays (RIAs), and fluorescence activated cell sorting (FACS). A two-site, 
monoclonal-based immunoassay utilizing monoclonal antibodies reactive to two 
non-interfering epitopes on HSCOP is preferred, but a competitive binding assay may be 
employed. These and other assays are well known in the art. (See, e.g., Hampton, R. et al. 
15 (1990) Serological Methods, a Laboratory Manual , APS Press, St Paul MN, Sect. IV; 

Coligan, J. E. et al. (1997) Current Protocols in Immunology , Greene Pub. Associates and 
Wiley-Interscience, New York NY; and Pound, J.D. (1998) Immunoche mical Protocols, 
Humana Press, Totowa NJ). 

A wide variety of labels and conjugation techniques are known by those skilled in 
20 the art and may be used in various nucleic acid and amino acid assays. Means for 
producing labeled hybridization or PCR probes for detecting sequences related to 
polynucleotides encoding HSCOP include oligolabeling, nick translation, end-labeling, or 
PCR amplification using a labeled nucleotide. Alternatively, the sequences encoding 
HSCOP, or any fragments thereof, may be cloned into a vector for the production of an 
25 mRNA probe. Such vectors are known in the art, are commercially available, and may be 
used to synthesize RNA probes in vitro by addition of an appropriate RN A polymerase 
such as T7, T3, or SP6 and labeled nucleotides. These procedures may be conducted using 
a variety of commercially available kits, such as those provided by Amersham Pharmacia 
Biotech, Promega (Madison WI), and US Biochemical. Suitable reporter molecules or 
30 labels which may be used for ease of detection include radionuclides, enzymes, 

fluorescent, chemiluminescent, or chromogenic agents, as well as substrates, cofactors. 
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inhibitors, magnetic particles, and the like. 

Host cells transformed with nucleotide sequences encoding HSCOP may be 
cultured under conditions suitable for the expression and recovery of the protein from cell 
culture. The protein produced by a transformed cell may be secreted or retained 

5 intracellularly depending on the sequence and/or the vector used. As will be understood 
by those of skill in the art. expression vectors containing polynucleotides which encode 
HSCOP may be designed to contain signal sequences which direct secretion of HSCOP 
through a prokaryotic or eukaryotic cell membrane. 

In addition, a host cell strain may be chosen for its ability to modulate expression 

10 of the inserted sequences or to process the expressed protein in the desired fashion. Such 
modifications of the polypeptide include, but are not limited to, acetylation, carboxylation, 
glycosyiation. phosphorylation, lipidation, and acylation. Post-translationai processing 
which cleaves a ''prepro'' form of the protein may also be used to specify protein targeting, 
folding, and/or activity. Different host cells which have specific cellular machinery and 

15 characteristic mechanisms for post-translational activities (e.g., CHO, HeLa, MDCK, 
HEK293, and WI38), are available from the American Type Culture Collection (ATCC, 
Bethesda MD) and may be chosen to ensure the correct modification and processing of the 
foreign protein. 

In another embodiment of the invention, natural, modified, or recombinant nucleic 
20 acid sequences encoding HSCOP may be ligated to a heterologous sequence resulting in 
translation of a fusion protein in any of the aforementioned host systems. For example, a 
chimeric HSCOP protein containing a heterologous moiety that can be recognized by a 
commercially available antibody may facilitate the screening of peptide libraries for 
inhibitors of HSCOP activity. Heterologous protein and peptide moieties may also 
25 facilitate purification of fusion proteins using commercially available affinity matrices. 
Such moieties include, but are not limited to, glutathione S-transferase (GST), maltose 
binding protein (MBP), thioredoxin (Trx), calmodulin binding peptide (CBP), 6-His, 
FLAG, c-myc, and hemagglutinin (HA), GST, MBP, Trx. CBP, and 6-His enable 
purification of their cognate fusion proteins on immobilized glutathione, maltose, 
30 phenylarsine oxide, calmodulin, and metal-chelate resins, respectively. FLAG, c-myc, and 
hemagglutinin (HA) enable immunoaffinity purification of fusion proteins using 
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commercially available monoclonal and polyclonal antibodies that specifically recognize 
these epitope tags. A fusion protein may also be engineered to contain a proteolytic 
cleavage site located between the HSCOP encoding sequence and the heterologous protein 
sequence, so that HSCOP may be cleaved away from the heterologous moiety following 

5 purification. Methods for fusion protein expression and purification are discussed in 
Ausubel (1995. supra , ch 10). A variety of commercially available kits may also be used 
to facilitate expression and purification of fusion proteins. 

In a further embodiment of the invention, synthesis of radiolabeled HSCOP may be 
achieved in vitro using the TNT rabbit reticulocyte lysate or wheat germ extract systems 

10 (Promega). These systems couple transcription and translation of protein-coding 

sequences operably associated with the T7, T3, or SP6 promoters. Translation takes place 
in the presence of a radiolabeled amino acid precursor, preferably ^^S-methionine. 

Fragments of HSCOP may be produced not only by recombinant production, but 
also by direct peptide synthesis using solid-phase techniques. (See, e.g.^ Creighton. supra. 

15 pp, 55-60.) Protein synthesis may be performed by manual techniques or by automation. 
Automated synthesis may be achieved, for example, using the ABI 431 A Peptide 
Synthesizer (Perkin-Elmer). Various fi:agments of HSCOP may be synthesized separately 
and then combined to produce the full length molecule. 
THERAPEUTICS 

20 Chemical and structural similarity, e.g., in the context of sequences and motifs, 

exists between regions of HSCOP and human SOCS proteins. In addition, the expression 
of HSCOP is closely associated with cancer, inflammation and the immune response, and 
neurological tissues. Therefore, HSCOP appears to play a role in cancer, immune and 
neurological disorders, and infectious diseases. In the treatment of diseases or disorders 

25 associated with increased HSCOP expression or activity, it is desirable to decrease the 
expression or activity of HSCOP. In the treatment of the above conditions associated with 
decreased HSCOP expression or activity, it is desirable to increase the expression or 
activity of HSCOP. 

Therefore, in one embodiment, HSCOP or a fragment or derivative thereof may be 
30 administered to a subject to treat or prevent a disorder associated with decreased 

expression or activity of HSCOP. Examples of such disorders include, but are not limited 
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to, a cancer such as adenocarcinoma, leukemia, lymphoma, melanoma, myeloma, sarcoma, 
teratocarcinoma, and, in particular, cancers of the adrenal gland, bladder, bone, bone 
marrow, brain, breast, cervix, gall bladder, ganglia, gastrointestinal tract, heart, kidney, 
liver, lung, muscle, ovary, pancreas, parathyroid, penis, prostate, salivary glands, skin, 
5 spleen, testis, thymus, thyroid, and uterus; an immune disorder such as acquired 
immunodeficiency syndrome (AIDS), Addison's disease, adult respiratory distress 
syndrome, allergies, ankylosing spondylitis, amyloidosis, anemia, asthma, atherosclerosis, 
autoimmune hemolytic anemia, autoimmune thyroiditis, bronchitis, cholecystitis, contact 
dermatitis, Crohn's disease, atopic dermatitis, dermatomyositis, diabetes meilitus, 
10 emphysema, episodic lymphopenia with lymphocytotoxins, erythroblastosis fetalis, 

erythema nodosum, atrophic gastritis, glomerulonephritis, Goodpasture's syndrome, gout, 
Graves^ disease, Hashimoto's thyroiditis, hypereosinophilia, irritable bowel syndrome, 
multiple sclerosis, myasthenia gravis, myocardial or pericardial inflammation, 
osteoarthritis, osteoporosis, pancreatitis, polymyositis, psoriasis, Reiter's syndrome, 
15 rheumatoid arthritis, scleroderma, Sjogren's syndrome, systemic anaphylaxis, systemic 
lupus erythematosus, systemic sclerosis, thrombocytopenic purpura, ulcerative colitis, 
uveitis, Werner syndrome, complications of cancer, hemodialysis, and extracorporeal 
circulation, viral, bacterial, fungal, parasitic, protozoal, and helminthic infections, and 
trauma; a neurological disorder such as akathesia, Alzheimer's disease, amnesia, 
20 amyotrophic lateral sclerosis, bipolar disorder, catatonia, cerebral neoplasms, dementia, 
depression, diabetic neuropathy, Dovm's syndrome, tardive dyskinesia, dystonias, 
epilepsy, Huntington's disease, peripheral neuropathy, multiple sclerosis, 
neurofibromatosis, Parkinson's disease, paranoid psychoses, postherpetic neuralgia, 
schizophrenia, and Tourette's disorder; and an infectious disease such a viral infection, 
25 e.g., those caused by adenoviruses (acute respiratory disease, pneumonia), arenaviruses 
(lymphocytic choriomeningitis), bunyaviruses (Hantavirus), coronaviruses (pneumonia, 
chronic bronchitis), hepadnaviruses (hepatitis), herpesviruses (herpes simplex virus, 
varicella-zoster virus, Epstein-Barr virus, cytomegalovirus), flaviviruses (yellow fever), 
orthomyxoviruses (influenza), papillomaviruses (cancer), paramyxoviruses (measles, 
30 mumps), picomoviruses (rhinovirus, poliovirus, coxsackie-virus), polyomaviruses (BK 
virus, JC virus), poxviruses (smallpox), reovirus (Colorado tick fever), retroviruses 
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(human immunodeficiency virus, human T lymphotropic virus), rhabdoviruses (rabies), 
rotaviruses (gastroenteritis), and togaviruses (encephalitis, rubella), and bacteriaL fungal, 
parasitic, protozoal, and helminthic infections. 

In another embodiment, a vector capable of expressing HSCOP or a fragment or 
5 derivative thereof may be administered to a subject to treat or prevent a disorder associated 
with decreased expression or activity of HSCOP including, but not limited to, those 
described above. 

In a further embodiment, a pharmaceutical composition comprising a substantially 
purified HSCOP in conjunction with a suitable pharmaceutical carrier may be 

10 administered to a subject to treat or prevent a disorder associated with decreased 
expression or activity of HSCOP including, but not limited to, those provided above. 

In still another embodiment, an agonist which modulates the activity of HSCOP 
may be administered to a subject to treat or prevent a disorder associated with decreased 
expression or activity of HSCOP including, but not limited to, those listed above. 

15 In a further embodiment, an antagonist of HSCOP may be administered to a 

subject to treat or prevent a disorder associated with increased expression or activity of 
HSCOP. Examples of such disorders include, but are not limited to, those described 
above. In one aspect, an antibody which specifically binds HSCOP may be used directly 
as an antagonist or indirectly as a targeting or delivery mechanism for bringing a 

20 pharmaceutical agent to cells or tissue which express HSCOP. 

In an additional embodiment, a vector expressing the complement of the 
polynucleotide encoding HSCOP may be administered to a subject to treat or prevent a 
disorder associated with increased expression or activity of HSCOP including, but not 
limited to, those described above. 

25 In other embodiments, any of the proteins, antagonists, antibodies, agonists, 

complementary sequences, or vectors of the invention may be administered in combination 
with other appropriate therapeutic agents. Selection of the appropriate agents for use in 
combination therapy may be made by one of ordinary skill in the art, according to 
conventional pharmaceutical principles. The combination of therapeutic agents may act 

30 synergistically to effect the treatment or prevention of the various disorders described 
above. Using this approach, one may be able to achieve therapeutic efficacy with lower 
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dosages of each agent, thus reducing the potential for adverse side effects. 

An antagonist of HSCOP may be produced using methods which are generally 
known in the art. In particular, purified HSCOP may be used to produce antibodies or to 
screen libraries of pharmaceutical agents to identify those which specifically bind HSCOP. 
5 Antibodies to HSCOP may also be generated using methods that are well known in the art. 
Such antibodies may include, but are not limited to, polyclonal, monoclonal, chimeric, and 
single chain antibodies, Fab fragments, and fragments produced by a Fab expression 
library. Neutralizing antibodies (i.e., those which inhibit dimer formation) are especially 
preferred for therapeutic use. 
10 For the production of antibodies, various hosts including goats, rabbits, rats, mice, 

humans, and others may be immunized by injection with HSCOP or with any fragment or 
oligopeptide thereof which has immunogenic properties. Depending on the host species, 
various adjuvants may be used to increase immunological response. Such adjuvants 
include, but are not limited to, Freund's, mineral gels such as aluminum hydroxide, and 
1 5 surface active substances such as ly solecithin, pluronic poly ols, polyanions, peptides, oil 
emulsions, KLH, and dinitrophenol. Among adjuvants used in humans, BCG (bacilli 
Calmette-Guerin) and Corvnebacteriimi parvum are especially preferable. 

It is preferred that the oligopeptides, peptides, or fragments used to induce 
antibodies to HSCOP have an amino acid sequence consisting of at least about 5 amino 
20 acids, and, more preferably, of at least about 10 amino acids. It is also preferable that 
these oligopeptides, peptides, or fragments are identical to a portion of the amino acid 
sequence of the natural protein and contain the entire amino acid sequence of a small, 
naturally occurring molecule. Short stretches of HSCOP amino acids may be fused with 
those of another protein, such as KLH, and antibodies to the chimeric molecule may be 
25 produced. 

Monoclonal antibodies to HSCOP may be prepared using any technique which 
provides for the production of antibody molecules by continuous cell lines in culture. 
These include, but are not limited to, the hybridoma technique, the human B-cell 
hybridoma technique, and the EBV-hybridoma technique, (See, e.g., Kohler, G. et al. 
30 (1975) Nature 256:495-497; Kozbor, D. et al. (1985) J. Immunol, Methods 81 :31-42; 
Cote, RJ. et aL (1983) Proc. Natl. Acad. Sci. 80:2026-2030; and Cole, S.P. et al. (1984) 
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Mol. Cell Biol. 62:109-120.) 

In addition, techniques developed for the production of "chimeric antibodies/' such 
as the splicing of mouse antibody genes to human antibody genes to obtain a molecule 
with appropriate antigen specificity and biological activity, can be used. (See, e.g,, 
5 Morrison, S.L. et al. (1984) Proc. Natl. Acad. Sci. 81 :6851-6855; Neuberger, M.S. et al. 
(1984) Nature 3 12:604-608; and Takeda. S. et al. (1985) Nature 3 14:452-454.) 
Alternatively, techniques described for the production of single chain antibodies may be 
adapted, using methods known in the art, to produce HSOCH-specific single chain 
antibodies. Antibodies with related specificity, but of distinct idiotypic composition, may 
10 be generated by chain shuffling from random combinatorial immunoglobulin libraries. 
(See, e.g., Burton D.R. (1991) Proc. Natl. Acad. Sci. 88:10134-10137.) 

Antibodies may also be produced by inducing in vivo production in the 
lymphocyte population or by screening immunoglobulin libraries or panels of highly 
specific binding reagents as disclosed in the literature. (See, e.g., Orlandi, R. et al. (1989) 
15 Proc. Natl. Acad. Sci. 86: 3833-3837; Winter, G. et al. (1991) Nature 349:293-299.) 

Antibody fragments which contain specific binding sites for HSCOP may also be 
generated. For example, such fragments include, but are not limited to, F(ab')2 fragments 
produced by pepsin digestion of the antibody molecule and Fab fragments generated by 
reducing the disulfide bridges of the F(ab')2 fragments. Alternatively, Fab expression 
20 libraries may be constructed to allow rapid and easy identification of monoclonal Fab 
fragments with the desired specificity. (See, e.g., Huse, W.D. et al. (1989) Science 
246:1275-1281.) 

Various immunoassays may be used for screening to identify antibodies having the 
desired specificity. Numerous protocols for competitive binding or immunoradiometric 

25 assays using either polyclonal or monoclonal antibodies with established specificities are 
well known in the art. Such immunoassays typically involve the measurement of complex 
formation between HSCOP and its specific antibody. A two-site, monoclonal-based 
immunoassay utilizing monoclonal antibodies reactive to two non-interfering HSCOP 
epitopes is preferred, but a competitive binding assay may also be employed (Pound, 

30 supra) . 

Various methods such as Scatchard analysis in conjunction with 
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radioimmunoassay techniques may be used to assess the affinity of antibodies for HSCOP. 
Affinity is expressed as an association constant, K^, which is defined as the molar 
concentration of HSOCH-antibody complex divided by the molar concentrations office 
antigen and free antibody under equilibrium conditions. The determined for a 
5 preparation of polyclonal antibodies, which are heterogeneous in their affinities for 

multiple HSCOP epitopes, represents the average affinity, or avidity, of the antibodies for 
HSCOP. The determined for a preparation of monoclonal antibodies, which are 
monospecific for a particular HSCOP epitope, represents a true measure of affinity. High- 
affinity antibody preparations with ranging from about 10^ to 10^- L/mole are preferred 
10 for use in immunoassays in which the HSOCH-antibody complex must withstand rigorous 
manipulations. Low-affinity antibody preparations with ranging from about 10^ to 10^ 
L/mole are preferred for use in immunopurification and similar procedures which 
ultimately require dissociation of HSCOP, preferably in active form, from the antibody 
(Catty, D. (1988) Antibodies. Volume I: A Practical Approach , IRL Press, Washington, 
15 DC; Liddell, J, E. and Cryer, A. (1991) A Practical Guide to Monoclonal Antibodies , John 
Wiley & Sons, New York NY). 

The titer and avidity of polyclonal antibody preparations may be further evaluated 
to determine the quality and suitability of such preparations for certain downstream 
applications. For example, a polyclonal antibody preparation containing at least 1-2 mg 
20 specific antibody/ml, preferably 5-10 mg specific antibody/ml, is preferred for use in 
procedures requiring precipitation of HSOCH-antibody complexes. Procedures for 
evaluating antibody specificity, titer, and avidity, and guidelines for antibody quality and 
usage in various applications, are generally available. (See, e,g.. Catty, supra , and Coligan 
et al. supra .) 

25 In another embodiment of the invention, the polynucleotides encoding HSCOP, or 

any fragment or complement thereof, may be used for therapeutic purposes. In one aspect, 
the complement of the polynucleotide encoding HSCOP may be used in situations in 
which it would be desirable to block the transcription of the mRN A. In particular, cells 
may be transformed with sequences complementary to polynucleotides encoding HSCOP. 

30 Thus, complementary molecules or fragments may be used to modulate HSCOP activity, 
or to achieve regulation of gene fiinction. Such technology is now well known in the art, 
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and sense or antisense oligonucleotides or larger fragments can be designed from various 
locations along the coding or control regions of sequences encoding HSCOP. 

Expression vectors derived from retroviruses, adenoviruses, or herpes or vaccinia 
viruses, or from various bacterial piasmids. may be used for delivery of nucleotide 

5 sequences to the targeted organ, tissue, or cell population. Methods which are well knovm 
to those skilled in the art can be used to construct vectors to express nucleic acid 
sequences complementary to the polynucleotides encoding HSCOP. (See, e.g., Sambrook, 
supra ; Ausubel, 1995, supra .) 

Genes encoding HSCOP can be turned off by transforming a cell or tissue with 

iO expression vectors which express high levels of a polynucleotide, or fragment thereof, 
encoding HSCOP, Such constructs may be used to introduce untranslatable sense or 
antisense sequences into a cell. Even in the absence of integration into the DNA, such 
vectors may continue to transcribe RNA molecules until they are disabled by endogenous 
nucleases. Transient expression may last for a month or more with a non-replicating 

1 5 vector, and may last even longer if appropriate replication elements are part of the vector 
system. 

As mentioned above, modifications of gene expression can be obtained by 
designing complementary sequences or antisense molecules (DNA, RNA, or PN A) to the 
control, 5\ or regulatory regions of the gene encoding HSCOP. Oligonucleotides derived 

20 from the transcription initiation site, e.g., between about positions -10 and +10 from the 
start site, are preferred. Similarly, inhibition can be achieved using triple helix 
base-pairing methodology. Triple helix pairing is useful because it causes inhibition of the 
ability of the double helix to open sufficiently for the binding of polymerases, 
transcription factors, or regulatory molecules. Recent therapeutic advances using triplex 

25 DNA have been described in the literature, (See, e.g., Gee, J.E. et al. (1994) in Huber, 
B.E, and B.L Carr, Molecular and Immunologic Approaches. Futura Publishing, Mt. Kisco 
NY, pp. 163-177.) A complementary sequence or antisense molecule may also be 
designed to block translation of mRNA by preventing the transcript from binding to 
ribosomes. 

30 Ribozymes, enzymatic RNA molecules, may also be used to catalyze the specific 

cleavage of RNA. The mechanism of ribozyme action involves sequence-specific 
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hybridization of the ribozyme molecule to complementary target RNA, followed by 
endonucleolytic cleavage. For example, engineered hammerhead motif ribozyme 
molecules may specifically and efficiently catalyze endonucleolytic cleavage of sequences 
encoding HSCOP. 

5 Specific ribozyme cleavage sites within any potential RNA target are initially 

identified by scanning the target molecule for ribozyme cleavage sites, including the 
following sequences: GUA, GUU. and GUC. Once identified, short RNA sequences of 
between 15 and 20 ribonucleotides, corresponding to the region of the target gene 
containing the cleavage site, may be evaluated for secondary structural features which may 

10 render the oligonucleotide inoperable. The suitability of candidate targets may also be 
evaluated by testing accessibility to hybridization with complementary oligonucleotides 
using ribonuclease protection assays. 

Complementary ribonucleic acid molecules and ribozymes of the invention may be 
prepared by any method known in the art for the synthesis of nucleic acid molecules. 

15 These include techniques for chemically synthesizing oligonucleotides such as solid phase 
phosphoramidite chemical synthesis. Alternatively, RNA molecules may be generated by 
in vitro and in vivo transcription of DNA sequences encoding HSCOP. Such DNA 
sequences may be incorporated into a wide variety of vectors v^th suitable RNA 
polymerase promoters such as T7 or SP6, Alternatively, these cDNA constructs that 

20 synthesize complementary RNA, constitutively or inducibly, can be introduced into cell 

lines, cells, or tissues. 

RNA molecules may be modified to increase intracellular stability and half-life. 
Possible modifications include, but are not limited to, the addition of flanking sequences at 
the 5' and/or 3' ends of the molecule, or the use of phosphorothioate or 1' O-methyl rather 

25 than phosphodiesterase linkages within the backbone of the molecule. This concept is 
inherent in the production of PNAs and can be extended in all of these molecules by the 
inclusion of nontraditional bases such as inosine, queosine. and wybutosine, as well as 
acetyl-, methyl-, thio-, and similarly modified forms of adenine, cytidine, guanine, 
thymine, and uridine which are not as easily recognized by endogenous endonucleases. 

30 Many methods for introducing vectors into cells or tissues are available and 

equally suitable for use in vivo , in vitro , and ex vivo . For ex vivo therapy, vectors may be 
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introduced into stem cells taken from the patient and clonally propagated for autologous 
transplant back into that same patient. Delivery by transfection, by liposome injections, or 
by polycationic amino polymers may be achieved using methods which are well knovm in 
the art. (See, e.g., Goldman, C.K, et aL (1997) Nature Biotechnology 15:462-466.) 
5 Any of the therapeutic methods described above may be applied to any subject in 

need of such therapy, including, for example, mammals such as dogs, cats, cows, horses, 
rabbits, monkeys, and most preferably, humans. 

An additional embodiment of the invention relates to the administration of a 
pharmaceutical or sterile composition, in conjunction with a pharmaceutically acceptable 
1 0 carrier, for any of the therapeutic effects discussed above. Such pharmaceutical 
compositions may consist of HSCOP, antibodies to HSCOP, and mimetics. agonists, 
antagonists, or inhibitors of HSCOP. The compositions may be administered alone or in 
combination with at least one other agent, such as a stabilizing compound, which may be 
administered in any sterile, biocompatible pharmaceutical carrier including, but not limited 
15 to, saline, buffered saline, dextrose, and water. The compositions may be administered to a 
patient alone, or in combination with other agents, drugs, or hormones. 

The pharmaceutical compositions utilized in this invention may be administered by 
any number of routes including, but not limited to, orai, intravenous, intramuscular, 
intra-arterial. intramedullary, intrathecal intraventricular, transdermal subcutaneous, 
20 intraperitoneal, intranasal, enteral, topical sublingual, or rectal means. 

In addition to the active ingredients, these pharmaceutical compositions may 
contain suitable pharmaceutically-acceptable carriers comprising excipients and auxiliaries 
which facilitate processing of the active compounds into preparations which can be used 
pharmaceutically. Further details on techniques for formulation and administration may 
25 be found in the latest edition of Remington's Pharmaceutical Sciences (Maack Publishing, 
Easton PA). 

Pharmaceutical compositions for oral administration can be formulated using 
pharmaceutically acceptable carriers well known in the art in dosages suitable for oral 
administration. Such carriers enable the pharmaceutical compositions to be formulated as 
30 tablets, pills, dragees, capsules, liquids, gels, syrups, slurries, suspensions, and the like, for 
ingestion by the patient. 
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Pharmaceutical preparations for oral use can be obtained through combining active 
compounds with soUd excipient and processing the resultant mixture of granules 
(optionally, after grinding) to obtain tablets or dragee cores. Suitable auxiliaries can be 
added, if desired. Suitable excipients include carbohydrate or protein fillers, such as 
5 sugars, including lactose, sucrose, mannitoL and sorbitol; starch from com, wheat, rice, 
potato, or other plants; cellulose, such as methyl cellulose, 

hydroxypropylmethyl-cellulose, or sodium carboxymethylcellulose; gums, including 
arable and tragacanth; and proteins, such as gelatin and collagen. If desired, disintegrating 
or solubilizing agents may be added, such as the cross-linked polyvinyl pyrrolidone, agar, 

10 and alginic acid or a salt thereof, such as sodium alginate. 

Dragee cores may be used in conjunction with suitable coatings, such as 
concentrated sugar solutions, which may also contain gum arable, talc, 
polyvinylpyrrolidone, carbopol gel, polyethylene glycol, and/or titanium dioxide, lacquer 
solutions, and suitable organic solvents or solvent mixtures, Dyestuffs or pigments may 

15 be added to the tablets or dragee coatings for product identification or to characterize the 
quantity of active compound, i.e., dosage. 

Pharmaceutical preparations which can be used orally include push-fit capsules 
made of gelatin, as well as soft, sealed capsules made of gelatin and a coating, such as 
glycerol or sorbitol. Push-fit capsules can contain active ingredients mixed with fillers or 

20 binders, such as lactose or starches, lubricants, such as talc or magnesium stearate, and, 
optionally, stabilizers. In soft capsules, the active compounds may be dissolved or 
suspended in suitable liquids, such as fatty oils, liquid, or liquid polyethylene glycol with 

or without stabilizers. 

Pharmaceutical formulations suitable for parenteral administration may be 

25 formulated in aqueous solutions, preferably in physiologically compatible buffers such as 
Hanks* solution. Ringer's solution, or physiologically buffered saline. Aqueous injection 
suspensions may contain substances which increase the viscosity of the suspension, such 
as sodium carboxymethyl cellulose, sorbitol, or dextran. Additionally, suspensions of the 
active compounds may be prepared as appropriate oily injection suspensions. Suitable 

30 lipophilic solvents or vehicles include fatty oils, such as sesame oil, or synthetic fatty acid 
esters, such as ethyl oleate, triglycerides, or liposomes. Non-lipid polycationic amino 
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polymers may also be used for delivery. Optionally, the suspension may also contain 
suitable stabilizers or agents to increase the solubility of the compounds and allow for the 
preparation of highly concentrated solutions. 

For topical or nasal administration, penetrants appropriate to the particular barrier 
5 to be permeated are used in the formulation. Such penetrants are generally known in the 
art. 

The pharmaceutical compositions of the present invention may be manufactured in 
a manner that is known in the art, e.g., by means of conventional mixing, dissolving, 
granulating, dragee-making, levigating, emulsifying, encapsulating, entrapping, or 

10 lyophilizing processes. 

The pharmaceutical composition may be provided as a salt and can be formed with 
many acids, including but not limited to, hydrochloric, sulfuric, acetic, lactic, tartaric, 
malic, and succinic acid. Salts tend to be more soluble in aqueous or other protonic 
solvents than are the corresponding free base forms. In other cases, the preferred 

15 preparation may be a lyophilized powder which may contain any or all of the following: 1 
mM to 50 mM histidine, 0.1% to 2% sucrose, and 2% to 7% mannitoi, at a pH range of 4.5 
to 5.5, that is combined with buffer prior to use. 

After pharmaceutical compositions have been prepared, they can be placed in an 
appropriate container and labeled for treatment of an indicated condition. For 

20 administration of HSCOP, such labeling would include amount, frequency, and method of 
administration. 

Pharmaceutical compositions suitable for use in the invention include compositions 
wherein the active ingredients are contained in an effective amount to achieve the intended 
purpose. The determination of an effective dose is well within the capability of those 

25 skilled in the art. 

For any compound, the therapeutically effective dose can be estimated initially 
either in cell culture assays, e.g., of neoplastic cells or in animal models such as mice, rats, 
rabbits, dogs, or pigs. An animal model may also be used to determine the appropriate 
concentration range and route of administration. Such information can then be used to 
30 determine useful doses and routes for administration in humans. 

A therapeutically effective dose refers to that amount of active ingredient, for 
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example HSCOP or fragments thereof, antibodies of HSCOP. and agonists, antagonists or 
inhibitors of HSCOP, which ameliorates the symptoms or condition. Therapeutic efficacy 
and toxicity may be determined by standard pharmaceutical procedures in cell cultures or 
with experimental animals, such as by calculating the ED50 (the dose therapeutically 
5 effective in 50% of the population) or LD50 (the dose lethal to 50% of the population) 
statistics. The dose ratio of toxic to therapeutic effects is the therapeutic index, and it can 
be expressed as the LD50/ED50 ratio. Pharmaceutical compositions which exhibit large 
therapeutic indices are preferred. The data obtained from cell culture assays and animal 
studies are used to formulate a range of dosage for human use. The dosage contained in 
1 0 such compositions is preferably within a range of circulating concentrations that includes 
the ED50 with little or no toxicity. The dosage varies within this range depending upon the 
dosage form employed, the sensitivity of the patient, and the route of administration. 

The exact dosage will be determined by the practitioner, in light of factors related 
to the subject requiring treatment. Dosage and administration are adjusted to provide 
1 5 sufficient levels of the active moiety or to maintain the desired effect. Factors which may 
be taken into account include the severity of the disease state, the general health of the 
subject, the age, weight, and gender of the subject, time and frequency of administration, 
drug combination(s), reaction sensitivities, and response to therapy. Long-acting 
pharmaceutical compositions may be administered every 3 to 4 days, every week, or 
20 biweekly depending on the half-life and clearance rate of the particular formulation. 

Normal dosage amounts may vary from about 0.1 to 100,000 Aig, up to a total 
dose of about 1 gram, depending upon the route of administration. Guidance as to 
particular dosages and methods of delivery is provided in the literature and generally 
available to practitioners in the art. Those skilled in the art will employ different 
25 formulations for nucleotides than for proteins or their inhibitors. Similarly, delivery of 
polynucleotides or polypeptides will be specific to particular cells, conditions, locations, 
etc. 

DIAGNOSTICS 

In another embodiment, antibodies which specifically bind HSCOP may be used 
30 for the diagnosis of disorders characterized by expression of HSCOP, or in assays to 
monitor patients being treated with HSCOP or agonists, antagonists, or inhibitors of 
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HSCOP, Antibodies useful for diagnostic purposes may be prepared in the same manner 
as described above for therapeutics. Diagnostic assays for HSCOP include methods which 
utilize the antibody and a label to detect HSCOP in human body fluids or in extracts of 
cells or tissues. The antibodies may be used with or without modification, and may be 
5 labeled by covaient or non-covalent attachment of a reporter molecule. A wide variety of 
reporter molecules, several of which are described above, are known in the art and may be 
used. 

A variety of protocols for measuring HSCOP. including ELISAs. RIAs, and FACS, 
are known in the art and provide a basis for diagnosing altered or abnormal levels of 
10 HSCOP expression. Normal or standard values for HSCOP expression are established by 
combining body fluids or cell extracts taken from normal mammalian subjects, preferably 
human, with antibody to HSCOP under conditions suitable for complex formation. The 
amount of standard complex formation may be quantitated by various methods, preferably 
by photometric means. Quantities of HSCOP expressed in subject, control, and disease 
15 samples from biopsied tissues are compared with the standard values. Deviation between 
standard and subject values establishes the parameters for diagnosing disease. 

In another embodiment of the invention, the polynucleotides encoding HSCOP 
may be used for diagnostic purposes. The polynucleotides which may be used include 
oligonucleotide sequences, complementar>^ RNA and DNA molecules, and PNAs. The 
20 polynucleotides may be used to detect and quantitate gene expression in biopsied tissues in 
which expression of HSCOP may be correlated with disease. The diagnostic assay may be 
used to determine absence, presence, and excess expression of HSCOP, and to monitor 
regulation of HSCOP levels during therapeutic intervention. 

In one aspect hybridization with PCR probes which are capable of detecting 
25 polynucleotide sequences, including genomic sequences, encoding HSCOP or closely 
related molecules may be used to identify nucleic acid sequences which encode HSCOP. 
The specificity of the probe, whether it is made from a highly specific region, e.g., the 5' 
regulatory region, or from a less specific region, e.g., a conserved motif, and the 
stringency of the hybridization or amplification (maximal, high, intermediate, or low), will 
30 determine whether the probe identifies only naturally occurring sequences encoding 
HSCOP, allelic variants, or related sequences. 
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Probes may also be used for the detection of reiated sequences, and should 
preferably have at least 50% sequence identity to any of the HSCOP encoding sequences. 
The hybridization probes of the subject invention may be DNA or RNA and may be 
derived from the sequence of SEQ ID NO: 10-1 8 or from genomic sequences including 
5 promoters, enhancers, and introns of the HSCOP gene. 

Means for producing specific hybridization probes for DNAs encoding HSCOP 
include the cloning of polynucleotide sequences encoding HSCOP or HSCOP derivatives 
into vectors for the production of mRNA probes. Such vectors are known in the art. are 
commercially available, and may be used to synthesize RNA probes in vitro by means of 
10 the addition of the appropriate RNA polymerases and the appropriate labeled nucleotides. 
Hybridization probes may be labeled by a variety of reporter groups, for example, by 
radionuclides such as ^"P or ^^S. or by enzymatic labels, such as alkaline phosphatase 
coupled to the probe via avidin/biotin coupling systems, and the like. 

Polynucleotide sequences encoding HSCOP may be used for the diagnosis of 
15 disorders associated with expression of HSCOP. Examples of such disorders include, but 
are not limited to, a cancer such as adenocarcinoma, leukemia, lymphoma, melanoma, 
myeloma, sarcoma, teratocarcinoma, and, in particular, cancers of the adrenal gland, 
bladder, bone, bone marrow, brain, breast, cervix, gall bladder, ganglia, gastrointestinal 
tract, heart, kidney, liver, lung, muscle, ovary, pancreas, parathyroid, penis, prostate, 
20 salivary glands, skin, spleen, testis, thymus, thyroid, and uterus; an immune disorder such 
as acquired immunodeficiency syndrome (AIDS), Addison's disease, adult respiratory 
distress syndrome, allergies, ankylosing spondylitis, amyloidosis, anemia, asthma, 
atherosclerosis, autoimmune hemolytic anemia, autoimmune thyroiditis, bronchitis, 
cholecystitis, contact dermatitis, Crohn's disease, atopic dermatitis, dermatomyositis, 
25 diabetes mellitus, emphysema, episodic lymphopenia with lymphocytotoxins, 
erythroblastosis fetalis, erythema nodosum, atrophic gastritis, glomerulonephritis, 
Goodpasture's syndrome, gout. Graves' disease, Hashimoto's thyroiditis, 
hypereosinophilia, irritable bowel syndrome, multiple sclerosis, myasthenia gravis, 
myocardial or pericardial inflammation, osteoarthritis, osteoporosis, pancreatitis, 
30 polymyositis, psoriasis, Reiter's syndrome, rheumatoid arthritis, scleroderma, Sjogren's 
syndrome, systemic anaphylaxis, systemic lupus erythematosus, systemic sclerosis. 
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thrombocytopenic purpura, ulcerative colitis, uveitis, Werner syndrome, complications of 
cancer, hemodialysis, and extracorporeal circulation, viral, bacterial, fungal, parasitic, 
protozoal, and helminthic infections, and trauma; a neurological disorder such as 
akathesia, Alzheimer's disease, amnesia, amyotrophic lateral sclerosis, bipolar disorder, 
5 catatonia, cerebral neoplasms, dementia, depression, diabetic neuropathy, Dovm's 
syndrome, tardive dyskinesia, dystonias, epilepsy, Huntington's disease, peripheral 
neuropathy, multiple sclerosis, neurofibromatosis, Parkinson's disease, paranoid 
psychoses, postherpetic neuralgia, schizophrenia, and Tourette's disorder; and an 
infectious disease such a viral infection, e.g., those caused by adenoviruses (acute 
10 respiratory disease, pneumonia), arenaviruses (lymphocytic choriomeningitis), 

bunyaviruses (Hantavirus), coronaviruses (pneumonia, chronic bronchitis), hepadnaviruses 
(hepatitis), herpesviruses (herpes simplex virus, varicella-zoster virus, Epstein-Barr virus, 
cytomegalovirus), flaviviruses (yellow fever), orthomyxoviruses (influenza), 
papillomaviruses (cancer), paramyxoviruses (measles, mumps), picomoviruses 
15 (rhinovirus, poliovirus, coxsackie-virus), polyomaviruses (BK virus, JC virus), poxviruses 
(smallpox), reovirus (Colorado tick fever), retroviruses (human immunodeficiency virus, 
human T lymphotropic virus), rhabdoviruses (rabies), rotaviruses (gastroenteritis), and 
togaviruses (encephalitis, rubella), and bacterial, fungal, parasitic, protozoal, and 
helminthic infections. The polynucleotide sequences encoding HSCOP may be used in 
20 Southern or northern analysis, dot blot, or other membrane -based technologies; in PGR 
technologies; in dipstick, pin, and multiformat ELISA-like assays; and in microarrays 
utilizing fluids or tissues from patients to detect altered HSCOP expression. Such 
qualitative or quantitative methods are well known in the art. 

In a particular aspect, the nucleotide sequences encoding HSCOP may be useful in 
25 assays that detect the presence of associated disorders, particularly those mentioned above. 
The nucleotide sequences encoding HSCOP may be labeled by standard methods and 
added to a fluid or tissue sample from a patient under conditions suitable for the formation 
of hybridization complexes. After a suitable incubation period, the sample is washed and 
the signal is quantitated and compared with a standard value. If the amount of signal in 
30 the patient sample is significantly altered in comparison to a control sample then the 
presence of altered levels of nucleotide sequences encoding HSCOP in the sample 
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indicates the presence of the associated disorder. Such assays may also be used to 
evaluate the efficacy of a particular therapeutic treatment regimen in animal studies, in 
clinical trials, or to monitor the treatment of an individual patient. 

In order to provide a basis for the diagnosis of a disorder associated with 
5 expression of HSCOP, a normal or standard profile for expression is established. This 
may be accomplished by combining body fluids or cell extracts taken from normal 
subjects, either animal or human, with a sequence, or a fragment thereof, encoding 
HSCOP, under conditions suitable for hybridization or amplification. Standard 
hybridization may be quantified by comparing the values obtained from normal subjects 
10 with values from an experiment in which a known amount of a substantially purified 
polynucleotide is used. Standard values obtained in this maimer may be compared with 
values obtained from samples from patients who are symptomatic for a disorder. 
Deviation from standard values is used to establish the presence of a disorder. 

Once the presence of a disorder is established and a treatment protocol is initiated, 
15 hybridization assays may be repeated on a regular basis to determine if the level of 
expression in the patient begins to approximate that which is observed in the normal 
subject. The results obtained from successive assays may be used to show the efficacy of 
treatment over a period ranging from several days to months. 

With respect to cancer, the presence of an abnormal amount of transcript (either 
20 under- or overexpressed) in biopsied tissue from an individual may indicate a 

predisposition for the development of the disease, or may provide a means for detecting 
the disease prior to the appearance of actual clinical symptoms. A more definitive 
diagnosis of this type may allow health professionals to employ preventative measures or 
aggressive treatment earlier thereby preventing the development or further progression of 
25 the cancer. 

Additional diagnostic uses for oligonucleotides designed from the sequences 
encoding HSCOP may involve the use of PGR. These oligomers may be chemically 
synthesized, generated enzymatically, or produced in vitro . Oligomers will preferably 
contain a fragment of a polynucleotide encoding HSCOP, or a fragment of a 
30 polynucleotide complementary to the polynucleotide encoding HSCOP, and will be 
employed under optimized conditions for identification of a specific gene or condition. 
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Oligomers may also be employed under less stringent conditions for detection or 
quantitation of closely related DNA or RNA sequences. 

Methods which may also be used to quantify the expression of HSCOP include 
radiolabeling or biotinylating nucleotides, coamplification of a control nucleic acid, and 

5 interpolating results from standard curves. (See, e.g., Melby, P.C. et al. (1993) J. 

ImmunoL Methods 159:235-244; Duplaa, C. et al. (1993) Anal. Biochem. 212:229-236.) 
The speed of quantitation of multiple samples may be accelerated by running the assay in 
an ELISA format where the oligomer of interest is presented in various dilutions and a 
spectrophotometric or colorimetric response gives rapid quantitation. 

10 In further embodiments, oligonucleotides or longer fragments derived from any of 

the polynucleotide sequences described herein may be used as targets in a microarray. The 
microarray can be used to monitor the expression level of large numbers of genes 
simultaneously and to identify genetic variants, mutations, and polymorphisms. This 
information may be used to determine gene function, to understand the genetic basis of a 

1 5 disorder, to diagnose a disorder, and to develop and monitor the activities of therapeutic 
agents. 

Microarrays may be prepared, used, and analyzed using methods known in the art. 
(See, e.g., Brennan, T.M. et al. (1995) U.S. Patent No. 5,474,796; Schena, M. et al (1996) 
Proc. Natl. Acad. Sci. 93:10614-10619; Baldeschweiler et al (1995) PCT application 
20 W095/251 116; Shaion, D. et al. (1995) PCT application WO95/35505; Heller, R.A. et al. 
(1997) Proc. Natl. Acad, Sci. 94:2150-2155; and Heller, M.J. et al. (1997) U.S. Patent No. 
5,605,662.) 

In another embodiment of the invention, nucleic acid sequences encoding HSCOP 
may be used to generate hybridization probes useful in mapping the naturally occurring 
25 genomic sequence. The sequences may be mapped to a particular chromosome, to a 
specific region of a chromosome, or to artificial chromosome constructions, e.g., human 
artificial chromosomes (HACs), yeast artificial chromosomes (YACs), bacterial artificial 
chromosomes (BACs), bacterial PI constructions, or single chromosome cDNA libraries. 
(See, e.g., Harrington, J.J. et al. (1997) Nat Genet. 15:345-355; Price, CM. (1993) Blood 
30 Rev. 7:127-134; and Trask, B.J. (1991) Trends Genet. 7:149-154.) 

Fluorescent in situ hybridization (FISH) may be correlated with other physical 
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chromosome mapping techniques and genetic map data. (See. e.g., Heinz-Uirich. et al. 
(1995) in Meyers, supra , pp. 965-968.) Examples of genetic map data can be found in 
various scientific journals or at the Online Mendelian Inheritance in Man (OMIM) site. 
Correlation between the location of the gene encoding HSCOP on a physical chromosomal 
5 map and a specific disorder, or a predisposition to a specific disorder, may help define the 
region of DNA associated with that disorder. The nucleotide sequences of the invention 
may be used to detect differences in gene sequences among normal, carrier, and affected 
individuals. 

In situ hybridization of chromosomal preparations and physical mapping 
1 0 techniques, such as linkage analysis using established chromosomal markers, may be used 
for extending genetic maps. Often the placement of a gene on the chromosome of another 
mammalian species, such as mouse, may reveal associated markers even if the number or 
arm of a particular human chromosome is not knovm. New sequences can be assigned to 
chromosomal arms by physical mapping. This provides valuable information to 
1 1> investigators searching for disease genes using positional cloning or other gene discovery 
techniques. Once the disease or syndrome has been crudely localized by genetic linkage to 
a particular genomic region, e.g., ataxia-telangiectasia to 1 lq22-23, any sequences 
mapping to that area may represent associated or regulatory genes for further investigation. 
(See, e.g., Gatti, R.A. et al. (1988) Nature 336:577-580.) The nucleotide sequence of the 
20 subject invention may also be used to detect differences in the chromosomal location due 
to translocation, inversion, etc., among normal, carrier, or affected individuals. 

In another embodiment of the invention, HSCOP, its catalytic or immunogenic 
fragments, or oligopeptides thereof can be used for screening libraries of compounds in 
any of a variety of drug screening techniques. The fragment employed in such screening 
25 may be free in solution, affixed to a solid support, borne on a cell surface, or located 

intracellularly. The formation of binding complexes between HSCOP and the agent being 
tested may be measured. 

Another technique for drug screening provides for high throughput screening of 
compounds having suitable binding affinity to the protein of interest. (See, e.g., Geysen, 
30 et al. (1984) PCT application WO84/03564.) In this method, large numbers of different 
small test compounds are synthesized on a solid substrate. The test compounds are reacted 
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with HSCOP, or fragments thereof, and washed. Bound HSCOP is then detected by 
methods well known in the art. Purified HSCOP can also be coated directly onto plates 
for use in the aforementioned drug screening techniques. Alternatively, non-neutralizing 
antibodies can be used to capture the peptide and immobilize it on a solid support, 

5 In another embodiment, one may use competitive drug screening assays in which 

neutralizing antibodies capable of binding HSCOP specifically compete with a test 
compound for binding HSCOP. In this manner, antibodies can be used to detect the 
presence of any peptide which shares one or more antigenic determinants with HSCOP. 
In additional embodiments, the nucleotide sequences which encode HSCOP may 

10 be used in any molecular biology techniques that have yet to be developed, provided the 
new techniques rely on properties of nucleotide sequences that are currently known, 
including, but not limited to, such properties as the triplet genetic code and specific base 
pair interactions. 

Without further elaboration, it is believed that one skilled in the art can, using the 
1 5 preceding description, utilize the present invention to its fullest extent. The following 
preferred specific embodiments are, therefore, to be construed as merely illustrative, and 
not limitative of the remainder of the disclosure in any was whatsoever. 

The entire disclosure of ail applications, patents, and publications, cited above and 
below, and of US provisional applications 60/087,104 (filed May 28. 1998), and 
20 09/216,006 (filed December 17. 1998) are hereby incorporated by reference. 

EXAMPLES 
I. Construction of cDN A Libraries 

RNA was purchased from Clontech or isolated from tissues described in Table 4. 
Some tissues were homogenized and lysed in guanidinium isothiocyanate, while others 
25 were homogenized and lysed in phenol or in a suitable mixture of denaturants. such as 
TRIZOL (Life Technologies), a monophasic solution of phenol and guanidine 
isothiocyanate. The resulting lysates were centrifuged over CsCl cushions or extracted 
with chloroform, RNA was precipitated from the lysates with either isopropanol or 
sodium acetate and ethanol or by other routine methods. 
30 Phenol extraction and precipitation of RNA were repeated as necessary to increase 

RNA purity. In some cases. RNA was treated with DNase. For most libraries, poly(A+) 
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RNA was isolated using oligo d(T)-coupled paramagnetic particles (Promega), 
OLIGOTEX latex particles (QIAGEN, Chatsworth CA), or an OLIGOTEX mRN A 
purification kit (QIAGEN). Alternatively, RNA was isolated directly from tissue lysates 
using other RNA isolation kits, e.g., the POLY(A)PURE mRNA purification kit (Ambion, 
5 Austin TX). 

In some cases, Stratagene was provided with RNA and constructed the 
corresponding cDNA libraries. Otherwise. cDNA was synthesized and cDNA libraries 
were constructed with the UNIZAP vector system (Stratagene) or SUPERSCRIPT plasmid 
system (Life Technologies), using the recommended procedures or similar methods known 
10 in the art. (See, e.g., AusubeU 1997, supra , units 5.1-6.6). Reverse transcription was 
initiated using oligo d(T) or random primers. Synthetic oligonucleotide adapters were 
ligated to double stranded cDNA, and the cDNA was digested with the appropriate 
restriction enzyme or enzymes. For most libraries, the cDNA was size-selected (300-1000 
bp) using SEPHACRYL SI 000, SEPHAROSE CL2B, or SEPHAROSE CL4B column 
15 chromatography (Amersham Pharmacia Biotech) or preparative agarose gel 

electrophoresis. cDNAs were ligated into compatible restriction enzyme sites of the 
polylinker of a suitable piasmid, e.g., PBLUESCRIPT plasmid (Stratagene), pSPORTl 
plasmid (Life Technologies), or pINCY (Incyte Pharmaceuticals, Palo Alto CA), 
Recombinant plasmids were transformed into competent E. coli cells including XL 1 -Blue, 
20 XLI-BlueMRF, or SOLR from Stratagene or DH5a, DHIOB, or ElectroMAX DHIOB 
from Life Technologies. 
IL Isolation of cDN A Clones 

Plasmids were recovered from host cells by in vivo excision, using the UNIZAP 
vector system (Stratagene) or cell lysis. Plasmids were purified using at least one of the 
25 following: a Magic or WIZARD Minipreps DNA purification system (Promega); an 

AGTC Miniprep purification kit (Edge Biosystems, Gaithersburg MD); and QIAWELL 8 
Plasmid, QIAWELL 8 Pius Plasmid, QIAWELL 8 Ultra Plasmid purification systems or 
the REAL Prep 96 plasmid kh from QIAGEN. Following precipitation, plasmids were 
resuspended in 0.1 ml of distilled water and stored, with or without lyophilization, at 4°C. 
30 Alternatively, plasmid DNA was amplified from host cell lysates using direct link 

PGR in a high-throughput format (Rao, V.B. (1994) Anal. Biochem. 216: 1-14). Host cell 
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lysis and thermal cycling steps were carried out in a single reaction mixture. Samples 
were processed and stored in 384-well plates, and the concentration of amplified plasmid 
DNA was quantified fluorometricaliy using PICOGREEN dye (Molecular Probes, Eugene 
OR) and a Fluoroskan II fluorescence scanner (Labsystems Oy. Helsinki, Finland). 

5 III. Sequencing and Analysis 

cDNA sequencing reactions were processed using standard methods or high- 
throughput instrumentation such as the ABI CATALYST 800 (Perkin-EImer) thermal 
cycler or the PTC-200 thermal cycler (MJ Research) in conjunction with the HYDRA 
microdispenser (Robbins Scientific) or the MICROLAB 2200 (Hamilton) liquid transfer 

\0 system. cDNA sequencing reactions were prepared using reagents provided by Amersham 
Pharmacia Biotech or supplied in ABI sequencing kits such as the ABI PRISM BIGDYE 
Terminator cycle sequencing ready reaction kit (Perkin-Elmer). Electrophoretic separation 
of cDNA sequencing reactions and detection of labeled polynucleotides were carried out 
using the MEGABACE 1000 DNA sequencing system (Molecular Dynamics); the ABI 

15 PRISM 373 or 377 sequencing systems (Perkin-Elmer) in conjunction with standard ABI 
protocols and base calling software; or other sequence analysis systems known in the art. 
Reading frames within the cDNA sequences were identified using standard methods 
(reviewed in Ausubel, 1997, supra , imit 7.7). Some of the cDNA sequences were selected 
for extension using the techniques disclosed in Example V. 

20 The polynucleotide sequences derived from cDNA sequencing were assembled and 

analyzed using a combination of software programs which utilize algorithms well known 
to those skilled in the art. Table 5 summarizes the tools, programs, and algorithms used 
and provides applicable descriptions, references, and threshold parameters. The first 
column of Table 5 shows the tools, programs, and algorithms used, the second column 

25 provides brief descriptions thereof, the third column presents appropriate references, all of 
which are incorporated by reference herein in their entirety, and the fourth column 
presents, where applicable, the scores, probability values, and other parameters used to 
evaluate the strength of a match between two sequences (the higher the score, the greater 
the homology between two sequences). Sequences were analyzed using MACDNASIS 

30 PRO software (Hitachi Software Engineering, South San Francisco CA) and 
LASERGENE software (DNASTAR). 
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The polynucleotide sequences were validated by removing vector, linker, and 
polyA sequences and by masking ambiguous bases, using algorithms and programs based 
on BLAST, dynamic programing, and dinucleotide nearest neighbor analysis. The 
sequences were then queried against a selection of public databases such as GenBank 
5 primate, rodent, mammalian, vertebrate, and eukaryote databases, and BLOCKS to acquire 
annotation, using programs based on BLAST, FASTA, and BLIMPS. The sequences were 
assembled into full length polynucleotide sequences using programs based on Phred, 
Phrap, and Consed, and were screened for open reading frames using programs based on 
GeneMark, BLAST, and FASTA. The full length polynucleotide sequences were 
10 translated to derive the corresponding full length amino acid sequences, and these full 
C3 length sequences were subsequently analyzed by querying against databases such as the 

J GenBank databases (described above), SwissProt, BLOCKS, PRINTS, Prosite, and 

Iff Hidden Markov Model (HMM)-based protein family databases such as PFAM. HMM is a 

probabilistic approach which analyzes consensus primary structures of gene families, 
rij 15 (See, e.g., Eddy, S.R. (1996) Curr. Opin. Str. Biol. 6:361-365.) 

;U The programs described above for the assembly and analysis of fiill length 

'"=^ polynucleotide and amino acid sequences were also used to identify polynucleotide 

in sequence fragments from SEQ ID NO: 10-1 8. Fragmerrts from about 20 to about 4000 

J'J nucleotides which are useful in hybridization and amplification technologies were 

20 described in The Invention section above. 
IV. Northern Analysis 

Northern analysis is a laboratory technique used to detect the presence of a 
transcript of a gene and involves the hybridization of a labeled nucleotide sequence to a 
membrane on which RNAs from a particular cell type or tissue have been bound. (See, 
25 e.g., Sambrook, supra , ch, 7; Ausubel, 1995, supra , ch. 4 and 16.) 

Analogous computer techniques applying BLAST were used to search for identical 
or related molecules in nucleotide databases such as GenBank or LIFESEQ database 
(Incyte Pharmaceuticals). This analysis is much faster than multiple membrane-based 
hybridizations. In addition, the sensitivity of the computer search can be modified to 
30 determine whether any particular match is categorized as exact or similar. The basis of the 
search is the product score, which is defined as: 
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% sequence identity x % maximum BLAST score 

100 

The product score takes into account both the degree of similarity between two sequences 
and the length of the sequence match. For example, with a product score of 40, the match 

5 will be exact within a 1% to 2% error, and. with a product score of 70. the match will be 
exact. Similar molecules are usually identified by selecting those which show product 
scores between 1 5 and 40. although lower scores may identify related molecules. 

The results of northern analyses are reported as a percentage distribution of 
libraries in which the transcript encoding HSCOP occurred. Analysis involved the 

10 categorization of cDNA libraries by organ/tissue and disease, disorder, or condition. The 
organ/tissue categories included cardiovascular, dermatologic, developmental, endocrine, 
gastrointestinal, hematopoietic/immune, musculoskeletal, nervous, reproductive, and 
urologic. The disease/disorder/condition categories included cancer, inflammation/trauma, 
cell proliferation, neurological, and pooled. For each category, the number of libraries 

1 5 expressing the sequence of interest was counted and divided by the total number of 

libraries across all categories. Percentage values of tissue-specific and disease-, disorder-, 
or condition-specific expression are reported in Table 3. 



V. Extension of HSCOP Encoding Polynucleotides 

20 The fall length nucleic acid sequences of SEQ ID NO: 10-18 were produced by 

extension of an appropriate fragment of the full length molecule using oligonucleotide 
primers designed from this fragment. One primer was synthesized to initiate 5' extension 
of the known fragment, and the other primer, to initiate 3' extension of the known 
fragment. The initial primers were designed using OLIGO 4.06 software (National 

25 Biosciences), or another appropriate program, to be about 22 to 30 nucleotides in length, 
to have a GC content of about 50% or more, and to anneal to the target sequence at 
temperatures of about 68''C to about 72 °C. Any stretch of nucleotides which would result 
in hairpin structures and primer-primer dimerizations was avoided. 

Selected human cDNA libraries were used to extend the sequence. If more than 

30 one extension was necessary or desired, additional or nested sets of primers were designed. 

High fidelity amplification was obtained by PGR using methods well known in the 
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art. PGR was performed in 96-well plates using the PTC-200 thermal cycler (MJ 
Research, Inc.). The reaction mix contained DNA template, 200 nmol of each primer, 
reaction buffer containing Mg-\ (NH4)2S04, and p-mercaptoethanol, Taq DNA 
polymerase (Amersham Pharmacia Biotech), ELONGASE enzyme (Life Technologies), 
and Pfu DNA polymerase (Stratagene), with the following parameters for primer pair PCI 
A and PCI B: Step 1: 94°C, 3 min; Step 2: 94°C, 15 sec; Step 3: 60°C, 1 min; Step 4: 
68°C, 2 min; Step 5: Steps 2, 3, and 4 repeated 20 times; Step 6: 68°C, 5 min; Step 7: 
storage at 4°C. In the alternative, the parameters for primer pair T7 and SK+ were as 
follows: Step 1: 94°C, 3 min; Step 2: 94°C, 15 sec; Step 3: 57°C, 1 min; Step 4: 68°C, 2 
min; Step 5: Steps 2, 3. and 4 repeated 20 times; Step 6: 68°C, 5 min; Step 7: storage at 
4°C. 

The concentration of DNA in each well was determined by dispensing 100 ^l 
PICOGREEN quantitation reagent (0.25% (v/v) PICOGREEN; Molecular Probes, Eugene 
OR) dissolved in IX TE and 0.5 ^il of undiluted PGR product into each well of an opaque 
fiuorimeter plate (Coming Costar, Acton MA), allowing the DNA to bind to the reagent. 
The plate was scanned in a Fluoroskan II (Labsystems Oy, Helsinki, Finland) to measure 
the fluorescence of the sample and to quantify the concentration of DNA. A 5 //I to 10 ^1 
aliquot of the reaction mixture was analyzed by electrophoresis on a 1 % agarose mini-gel 
to determine which reactions were successful in extending the sequence. 

The extended nucleotides were desalted and concentrated, transferred to 384-well 
plates, digested with CviJI cholera virus endonuclease (Molecular Biology Research, 
Madison WI), and sonicated or sheared prior to religation into pUC 1 8 vector (Amersham 
Pharmacia Biotech). For shotgun sequencing, the digested nucleotides were separated on 
low concentration (0.6 to 0.8%) agarose gels, fragments were excised, and agar digested 
i with Agar ACE (Promega). Extended clones were religated using T4 ligase (New England 
Biolabs, Beverly MA) into pUC 18 vector (Amersham Pharmacia Biotech), treated with 
Pfu DNA polymerase (Stratagene) to fiU-in restriction site overhangs, and transfected into 
competent E. coli cells. Transformed cells were selected on antibiotic-containing media, 
individual colonies were picked and cultured overnight at 37° C in 384-well plates in 

0 LB/2x carb liquid media. 

The cells were lysed, and DNA was amplified by PGR using Taq DNA 
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polymerase (Amersham Phannacia Biotech) and Pfu DNA polymerase (Stratagene) with 
the following parameters: Step 1: 94 °C, 3 min; Step 2: 94°C, 15 sec; Step 3: 60°C, 1 min; 
Step 4: 72°C, 2 min; Step 5: steps 2, 3, and 4 repeated 29 times; Step 6: 72 °C, 5 min; Step 
7: storage at 4°C. DNA was quantified by PICOGREEN reagent (Molecular Probes) as 
5 described above. Samples with low DNA recoveries were reamplified using the same 
conditions as described above. Samples were diluted with 20% dimethysulphoxide (1:2, 
v/v), and sequenced using DYENAMIC energy transfer sequencing primers and the 
DYENAMIC DIRECT kit (Amersham Pharmacia Biotech) or the ABl PRISM BIGDYE 
Terminator cycle sequencing ready reaction kit (Perkin-Elmer). 
10 In like manner, the nucleotide sequences of SEQ ID NO:10-18 are used to obtain 

5' regulatory sequences using the procedure above, oligonucleotides designed for such 
extension, and an appropriate genomic library. 
VL Labeling and Use of Individual Hybridization Probes 

Hybridization probes derived from SEQ ID NO: 10-1 8 are employed to screen 
1 5 cDN As, genomic DNAs, or mRN As. Although the labeling of oligonucleotides, 

consisting of about 20 base pairs, is specifically described, essentially the same procedure 
is used with larger nucleotide fragments. Oligonucleotides are designed using state-of-the- 
art software such as OLIGO 4.06 software (National Biosciences) and labeled by 
combining 50 pmol of each oligomer, 250 /uCi of [7-^^P] adenosine triphosphate 
20 (Amersham Pharmacia Biotech), and T4 polynucleotide kinase (DuPont NEN, Boston 
MA). The labeled oligonucleotides are substantially purified using a SEPHADEX G-25 
superfine size exclusion dextran bead column (Amersham Pharmacia Biotech). An aliquot 
containing 10^ counts per minute of the labeled probe is used in a typical membrane-based 
hybridization analysis of human genomic DNA digested with one of the following 
25 endonucleases: Ase I, Bgl II, Eco RI, Pst I, Xbal , or Pvu II (DuPont NEN). 

The DNA from each digest is fractionated on a 0.7% agarose gel and transferred 
to nylon membranes (Nytran Plus, Schleicher & Schuell, Durham NH). Hybridization is 
carried out for 16 hours at 40°C. To remove nonspecific signals, blots are sequentially 
washed at room temperature under increasingly stringent conditions up to 0. 1 x saline 
30 sodium citrate and 0.5% sodium dodecyl sulfate. Hybridization patterns are visualized 
using autoradiography and compared. 
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VIL Microarrays 

A chemical coupling procedure and an ink: jet device can be used to synthesize 
array elements on the surface of a substrate. (See, e.g., Baldeschweiler, supra .) An array 
analogous to a dot or slot blot may also be used to arrange and link elements to the surface 

5 of a substrate using thermal, UV, chemical, or mechanical bonding procedures. A typical 
array may be produced by hand or using available methods and machines and contain any 
appropriate number of elements. After hybridization, nonhybridized probes are removed 
and a scaimer used to determine the levels and patterns of fluorescence. The degree of 
complementarity and the relative abundance of each probe which hybridizes to an element 

10 on the microarray may be assessed through analysis of the scanned images. 

Full-length cDNAs, Expressed Sequence Tags (ESTs), or fragments thereof may 
comprise the elements of the microarray. Fragments suitable for hybridization can be 
selected using software well known in the art such as LASERGENE software 
(DNASTAR). Full-length cDNAs, ESTs, or fragments thereof corresponding to one of the 

1 5 nucleotide sequences of the present invention, or selected at random from a cDNA library 
relevant to the present invention, are arranged on an appropriate substrate, e.g., a glass 
slide. TTie cDNA is fixed to the slide using, e.g., UV cross-linking followed by thermal 
and chemical treatments and subsequent drying. (See, e.g., Schena, M. et al. (1995) 
Science 270:467-470; Shalon, D. et al. (1996) Genome Res. 6:639-645.) Fluorescent 

20 probes are prepared and used for hybridization to the elements on the substrate. The 
substrate is analyzed by procedures described above. 
Vllh Complementary Polynucleotides 

Sequences complementary to the HSOCH-encoding sequences, or any parts 
thereof, are used to detect, decrease, or inhibit expression of naturally occurring HSCOP. 

25 Although use of oligonucleotides comprising from about 15 to 30 base pairs is described, 
essentially the same procedure is used with smaller or with larger sequence fragments. 
Appropriate oligonucleotides are designed using OLIGO 4.06 software (National 
Biosciences) and the coding sequence of HSCOP. To inhibit transcription, a 
complementary oligonucleotide is designed from the most unique 5* sequence and used to 

30 prevent promoter binding to the coding sequence. To inhibit translation, a complementary 
oligonucleotide is designed to prevent ribosomal binding to the HSOCH-encoding 
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transcript. 

IX, Expression of HSCOP 

Expression and purification of HSCOP is achieved using bacterial or virus-based 
expression systems. For expression of HSCOP in bacteria, cDNA is subcloned into an 

5 appropriate vector containing an antibiotic resistance gene and an inducible promoter that 
directs high levels of cDNA transcription. Examples of such promoters include, but are 
not limited to, the trp-lac (tac) hybrid promoter and the T5 or T7 bacteriophage promoter 
in conjunction with the lac operator regulatory element. Recombinant vectors are 
transformed into suitable bacterial hosts, e.g*, BL21(DE3), Antibiotic resistant bacteria 

10 express HSCOP upon induction with isopropyi beta-D-thiogalactopyranoside (IPTG). 
Expression of HSCOP in eukaryotic cells is achieved by infecting insect or mammalian 
cell lines with recombinant Autogranhica califomica nuclear polyhedrosis virus 
(AcMNPV), commonly known as baculovirus. The nonessential polyhedrin gene of 
baculovirus is replaced with cDNA encoding HSCOP by either homologous 

1 5 recombination or bacterial-mediated transposition involving transfer plasmid 

intermediates. Viral infectivity is maintained and the strong polyhedrin promoter drives 
high levels of cDN A transcription. Recombinant baculovirus is used to infect Spodoptera 
frugiperda (Sf9) insect cells in most cases, or human hepatocytes, in some cases. Infection 
of the latter requires additional genetic modifications to baculovirus. (See Engelhard, E. 

20 K. et aL (1994) Proc. Nati. Acad. Sci. USA 91:3224-3227; Sandig, V. et al. (1996) Hum. 
Gene Ther. 7:1937-19450 

In most expression systems, HSCOP is synthesized as a fusion protein with, e.g., 
glutathione S-transferase (GST) or a peptide epitope tag, such as FLAG or 6-His, 
permitting rapid, single-step, affinity-based purification of recombinant fiision protein 

25 from crude cell lysates. GST, a 26-kilodalton enzyme from Schistosoma ianonicum . 
enables the purification of fusion proteins on immobilized glutathione under conditions 
that maintain protein activity and antigenicity (Amersham Pharmacia Biotech), Following 
purification, the GST moiety can be proteolyticaily cleaved from HSCOP at specifically 
engineered sites, FLAG, an 8-amino acid peptide, enables immunoaffinity purification 

30 using commercially available monoclonal and polyclonal anti-FLAG antibodies (Eastman 
Kodak). 6-His, a stretch of six consecutive histidine residues, enables purification on 
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metal-chelate resins (QIAGEN). Methods for protein expression and purification are 
discussed in Ausubel (1995, supra, ch 10 and 16). Purified HSCOP obtained by these 
methods can be used directly in the following activity assay. 
X. Demonstration of HSCOP Activity 
5 HSCOP activity is demonstrated by the inhibition of differentiation in murine Ml 

cells transfected with the gene expressing HSCOP and induced to differentiate by 
treatment with IL-6 (Starr et aL, supra) . Differentiation is measured in the parent Ml cell 
line and in Ml cells transfected with HSCOP by the appearance of differentiated colonies 
arising from cells grown in semi-soft agar culture. The percent inhibition of 
10 differentiation in Ml transfected cells compared to the parent Ml cell line is proportional 

13 to the activity of HSCOP in the former cells. 

Q XI. Functional Assays 

J 7 HSCOP function is assessed by expressing the sequences encoding HSCOP at 

fU physiologically elevated levels in mammalian cell culture systems. cDNA is subcloned 

ifll 1 5 into a mammalian expression vector containing a strong promoter that drives high levels 

J:^ of cDNA expression. Vectors of choice include pCMV SPORT (Life Technologies) and 

SI pCR3, 1 (Invitrogen, Carlsbad CA), both of which contain the cytomegalovirus promoter, 

III 5-10 yug of recombinant vector are transiently transfected into a human cell line, preferably 

of endothelial or hematopoietic origin, using either liposome formulations or 
20 electroporation. 1-2 jug of an additional plasmid containing sequences encoding a marker 
protein are co-transfected. Expression of a marker protein provides a means to distinguish 
transfected cells from nontransfected cells and is a reliable predictor of cDNA expression 
from the recombinant vector. Marker proteins of choice include, e.g., Green Fluorescent 
Protein (GFP; Clontech), CD64, or a CD64-GFP fusion protein. Flow cytometry (FCM), 
25 an automated, laser optics-based technique, is used to identify transfected cells expressing 
GFP or CD64-GFP and to evaluate properties, for example, their apoptotic state. FCM 
detects and quantifies the uptake of fluorescent molecules that diagnose events preceding 
or coincident with cell death. These events include changes in nuclear DNA content as 
measured by staining of DNA with propidium iodide; changes in cell size and granularity 
30 as measured by forward Ught scatter and 90 degree side iight scatter; down-regulation of 
DNA synthesis as measured by decrease in bromodeoxyuridine uptake; alterations in 
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expression of cell surface and intracellular proteins as measured by reactivity with specific 
antibodies; and alterations in plasma membrane composition as measured by the binding 
of fluorescein-conjugated Annexin V protein to the cell surface. Methods in flow 
cytometry are discussed in Ormerod, M. G. (1994) Flow Cytometry , Oxford, New York 
5 NY. 

The influence of HSCOP on gene expression can be assessed using highly 
purified populations of cells transfected with sequences encoding HSCOP and either 
CD64 or CD64-GFP. CD64 and CD64-GFP are expressed on the surface of transfected ' 
cells and bind to conserved regions of human immunoglobulin G (IgG). Transfected cells 

10 are efficiently separated from nontransfected ceils using magnetic beads coated with either 
human IgG or antibody against CD64 (DYNAL, Lake Success NY). mRNA can be 
purified from the cells using methods well known by those of skill in the art. Expression 
of mRNA encoding HSCOP and other genes of interest can be analyzed by northern 
analysis or microarray techniques. 

1 5 XII. Production of HSCOP Specific Antibodies 

HSCOP substantially purified using polyacrylamide gel electrophoresis (PAGE; 
see, e.g,, Harrington, M.G. (1990) Methods EnzymoL 182:488-495), or other purification 
techniques, is used to immunize rabbits and to produce antibodies using standard 
protocols. 

20 Alternatively, the HSCOP amino acid sequence is analyzed using LASERGENE 

software (DNASTAR) to determine regions of high immunogenicity. and a corresponding 
oligopeptide is synthesized and used to raise antibodies by means known to those of skill 
in the art. Methods for selection of appropriate epitopes, such as those near the 
C-terminus or in hydrophilic regions are well described in the art. (See, e.g., Ausubel, 

25 1 995, supra , ch, 1 1 ,) 

Typically, oligopeptides 15 residues in length are synthesized using an ABI 431 A 
Peptide Synthesizer (Perkin-Elmer) using fmoc-chemistry and coupled to KLH (Sigma- 
Aldrich. St. Louis MO) by reaction with N-maleimidobenzoyl-N-hydroxysuccinimide 
ester (MBS) to increase immunogenicity. (See, e.g., Ausubel, 1995, supra .") Rabbits are 

30 immunized with the oligopeptide-KLH complex in complete Freund's adjuvant. Resulting 
antisera are tested for antipeptide activity by, for example, binding the peptide to plastic, 
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blocking with 1% BSA, reacting with rabbit antisera, washing, and reacting with radio- 
iodinated goat anti-rabbit IgG. 

XIII. Purification of Naturally Occurring HSCOP Using Specific Antibodies 

Naturally occurring or recombinant HSCOP is substantially purified by 

5 immunoaffinity chromatography using antibodies specific for HSCOP, An 

immunoaffinity column is constructed by covalently coupling anti-HSOCH antibody to an 
activated chromatographic resin, such as CNBr-activated SEPH AROSE (Amersham 
Pharmacia Biotech). After the coupling, the resin is blocked and washed according to the 
manufacturer's instructions. 

10 Media containing HSCOP are passed over the immunoaffinity column, and the 

column is washed under conditions that allow the preferential absorbance of HSCOP (e.g., 
high ionic strength buffers in the presence of detergent). The column is eluted under 
conditions that disrupt antibody/HSOCH binding (e.g., a buffer of pH 2 to pH 3, or a high 
concentration of a chaotrope, such as urea or thiocyanate ion), and HSCOP is collected. 

1 5 XIV. Identification of Molecules Wliich Interact with HSCOP 

HSCOP, or biologically active fragments thereof, are labeled with 
Bolton-Hunter reagent. (See, e.g., Bolton et al. (1973) Biochem. J. 133:529.) Candidate 
molecules previously arrayed in the wells of a multi-weli plate are incubated with the 
labeled HSCOP, washed, and any wells with labeled HSCOP complex are assayed. Data 

20 obtained using different concentrations of HSCOP are used to calculate values for the 
number, affinity, and association of HSCOP with the candidate molecules. 

Various modifications and variations of the described methods and systems of the 
invention will be apparent to those skilled in the art without depeirting from the scope and 
spirit of the invention. Although the invention has been described in connection with 

25 specific preferred embodiments, it should be understood that the invention as claimed 
should not be unduly limited to such specific embodiments. Indeed, various modifications 
of the described modes for carrying out the invention which are obvious to those skilled in 
molecular biology or related fields are intended to be within the scope of the following 
claims. 
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What is claimed is: 

1 , A substantially purified polypeptide comprising an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-9, and fragments thereof, 
5 2. A substantially purified variant having at least 90% amino acid sequence 

identity to the amino acid sequence of claim 1 , 

3. An isolated and purified polynucleotide encoding the polypeptide of claim 

1, 

4. An isolated and purified polynucleotide variant having at least 90% 
10 polynucleotide sequence identity to the polynucleotide of claim 3. 

5. An isolated and purified polynucleotide which hybridizes under stringent 
conditions to the polynucleotide of claim 3, 

6. An isolated and purified polynucleotide having a sequence w^hich is 
complementary to the polynucleotide of claim 3, 

15 7. A method for detecting a polynucleotide, the method comprising the steps 

of: 

(a) hybridizing the polynucleotide of claim 6 to at least one nucleic acid 
in a sample, thereby forming a hybridization complex; and 

(b) detecting the hybridization complex, wherein the presence of the 
20 hybridization complex correlates with the presence of the polynucleotide in the 

sample. 

8. The method of claim 7 further comprising amplifying the polynucleotide 
prior to hybridization. 

9. An isolated and purified polynucleotide comprising a polynucleotide 

25 sequence selected from the group consisting of SEQ ID NO: 10-1 8, and fragments thereof. 

10. An isolated and purified polynucleotide variant having at least 90% 
polynucleotide sequence identity to the polynucleotide of claim 9. 

1 1 . An isolated and purified polynucleotide having a sequence which is 
complementary to the polynucleotide of claim 9. 

30 12. An expression vector comprising at least a fragment of the polynucleotide 

of claim 3. 
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13. A host cell comprising the expression vector of claim 12. 

14. A method for producing a polypeptide, the method comprising the steps of: 
a) culturing the host cell of claim 1 3 under conditions suitable for the 

expression of the polypeptide; and 
5 b) recovering the polypeptide from the host cell culture. 

15. A pharmaceutical composition comprising the polypeptide of claim 1 in 
conjunction with a suitable pharmaceutical carrier, 

16. A purified antibody w^hich specifically binds to the polypeptide of claim I . 

1 7. A purified agonist of the polypeptide of claim 1 . 

10 18. A purified antagonist of the polypeptide of claim L 

19. A method for treating or preventing a disorder associated with decreased 
expression or activity of HSCOP, the method comprising administering to a subject in 
need of such treatment an effective amount of the pharmaceutical composition of claim 15. 

20. A method for treating or preventing a disorder associated with increased 
15 expression or activity of HSCOP, the method comprising administering to a subject in 

need of such treatment an effective amount of the antagonist of claim 18. 



-69- 



wo 99/61614 



PCT/US99/11497 



SEQUENCE LISTING 



:110> INCYTE PHARMACEUTICALS, INC. 
LAL, Preeti 
HILLMAN, Jennifer L. 
GORGONE, Gina 
CORLEY, Neil C. 
PATTERSON , Chandra 
YUE , Henry 
TANG, y. Tom 
AZIMZAI , Yalda 

<120> HUMAN SOCS PROTEINS 
<130> PF-0525 PCT 

<140> To Be Assigned 
<141> Herewith 

<150> 60/087,104; 09/216,006 
<151> 1998-05-28; 1998-12-17 

<160> 18 

<17 0> PERL Program 

<210> 1 

<211> 288 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feat:ure 

<223> Incyte clone 1758450 



Ser Ser Ser Met Trp Tyr He Met Gin Ser He Gin Ser I.ys 

Tyi ser X,eu Ser Glu Arg l-eu He Arg Thr He Ala Ala He Arg 

ser Phe Pro His Asn Val Glu Asp Leu He Arg Gly Gly Ala 

ASP val Asn Cys His Gly Thr Leu Lys Pro Leu His Cys Ala 

cys Met val Ser Asp Ala Asp Cys Val Glu Leu Leu Leu Glu Lys 

Gly Ala Glu val Asn Ala Leu Asp Gly Tyr Asn Arg Thr Ala Leu 

His Tyr Ala Ala Glu Lys Asp Glu Ala Cys Val Glu Val Leu Leu 

Glu Tyr Gly Ala Asn Pro Asn Ala Leu Asp Gly Asn Arg Asp Thr 

110 ^-^^ 
pro Leu His Trp Ala Ala Phe Lys Asn Asn Ala Glu Cys Val Arg 

Ala Leu Leu Glu Ser Gly Ala Ser Val ^sn Ala Leu Asp Tyr Asn 



140 



1/19 



wo 99/61614 



PCT/US99/n497 



Asn Asp Thr 
Ser Val Ser 
Asn Leu He 
Val Arg Gly 
Leu His Arg 
Met Pro Arg 
Tiir Val Leu 
Tyr Ala Val 
Val Lys Gly 
Leu Leu Glu 



Pro Leu Ser 
155 

lie Leu Leu 

170 
Gly Gin Thr 

185 
Leu Gly Thr 

200 

Ala Val Gly 
215 

Glu Val Ala 
230 

Cys Ser Ala 
245 

Arg Arg Ser 

260 
Leu Pro Leu 

275 



Trp Ala Ala 
Asp Tyr Gly 
Pro He Ser 
Glu Lys Glu 
His Phe Glu 
Arg Asp Pro 
Pro Gly Thr 
Leu Gly Leu 
Pro Ala Ser 



Met Lys Gly 
160 

Ala Glu Val 
175 

Arg Leu Val 
190 

Asp Ser Cys 
205 

Leu Arg Lys 
220 

Gin Leu Cys 
235 

Leu Lys Thr 
250 

Gin Tyr Leu 
265 

Leu Lys Glu 
280 



Asn Leu Glu 
165 

Arg Val He 
180 

Ala Leu Leu 
195 

Phe Glu Leu 
210 

Asn Gly Thr 

225 

Glu Lys Leu 
240 

Leu Ala Arg 
255 

Pro Asp Ala 
270 

Tyr Leu Leu 
285 



<210> 2 

<211> 423 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte clone 1834242 



<400> 2 

Met Lys Leu 
1 

Arg Lys Gly 
Glu Leu Lys 
Cys Glu Thr 
Ala Trp Ser 
Leu Glu Glu 
Ser Ser Lys 
Thr Leu Asp 
Trp Pro Ser 
Gin Val Pro 
Asp Gly Gin 
Leu Asn Leu 



Thr Pro Arg 
5 

Lys Arg Ser 

20 

Pro Gly Arg 
35 

Trp Ser Val 
50 

Gin Gly His 
65 

Gin Phe He 
80 

Asn Glu Thr 
95 

Cys Gly Gin 
110 

Pro Pro Ser 
125 

Asp Val Ser 
140 

He Lys He 
155 

Ser Gly His 
170 



Thr Ala Gly 
Ser Trp Gly 
Pro His Gin 
Ala Phe Ser 
Cys He Val 
Pro Lys Gly 
Lys Gly Arg 
He Val Trp 
Arg Lys Leu 
Cys Leu Val 
Trp Glu Val 
Gin Asp Val 



Arg Ala Trp 
10 

Gly Thr Ala 
25 

Phe Asp Trp 
40 

Pro Asp Gly 
55 

Lys Leu He 
70 

Phe Glu Ala 
85 

Gly Ser Pro 
100 

Gly Leu Ala 
115 

Trp Ala Arg 
130 

Leu Ala Thr 
145 

Gin Thr Gly 
160 

Val Arg Asp 
175 



Ala Gin Ser 
15 

Ala Val Ala 
30 

Lys Ser Ser 
45 

Ser Trp Phe 
60 

Pro Trp Pro 
75 

Lys Ser Arg 
90 

Lys Glu Lys 
105 

Phe Ser Pro 

120 

His His Pro 
135 

Gly Leu Asn 
150 

Leu Leu Leu 
165 

Leu Ser Phe 
ISO 
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Thr Pro Ser Gly Ser Leu lie Leu Val Ser Ala Ser Arg Asp Lys 

185 190 195 

Thr Leu Arg He Trp Asp Leu Asn Lys His Gly Lys Gin He Glrx 

200 205 210 

Val Leu Ser Gly His Leu Gin Trp Val Tyr Cys Cys Ser He Ser 

215 220 225 

Pro Asp Cys Ser Met Leu Cys Ser Ala Ala Gly Glu Lys Ser Val 

230 235 240 

Phe Leu Trp Ser Met Arg Ser Tyr Thr Leu He Arg Lys Leu Glu 

245 250 255 

Gly His Gin Ser Ser Val Val Ser Cys Asp Phe Ser Pro Asp Ser 

260 265 270 

Ala Leu Leu Val Thr Ala Ser Tyr Asp Thr Asn Val He Met Trp 

275 280 285 

Asp Pro Tyr Thr Gly Glu Arg Leu Arg Ser Leu His His Thr Gin 

290 295 300 

Val Asp Pro Ala Met Asp Asp Ser Asp Val His He Ser Ser Leu 

305 310 315 

Arg Ser Val Cys Phe Ser Pro Glu Gly Leu Tyr Leu Ala Thr Val 

320 325 330 

Ala Asp Asp Arg Leu Leu Arg He Trp Ala Leu Glu Leu Lys Thr 

335 340 345 

Pro He Ala Phe Ala Pro Met Thr Asn Gly Leu Cys Cys Thr Phe 

350 355 360 

Phe Pro His Gly Gly Val He Ala Thr Gly Thr Arg Asp Gly His 

365 370 375 

Val Gin Phe Trp Thr Ala Pro Arg Val Leu Ser Ser Leu Lys His 

380 385 390 

Leu Cys Arg Lys Ala Leu Arg Ser Phe Leu Thr Thr Tyr Gin Val 

395 400 405 

Leu Ala Leu Pro He Pro Lys Lys Met Lys Glu Phe Leu Thr Tyr 

410 415 420 

Arg Thr Phe 



<210> 3 
<211> 349 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte clone 1849725 

<400> 3 

Met Glu Asp Pro Gin Ser Lys Glu Pro Ala Gly Glu Ala Val Ala 
15 10 15 

Pro Ala Leu Leu Glu Ser Pro Arg Pro Glu Gly Gly Glu Glu Pro 

20 25 30 

Pro Arg Pro Ser Pro Glu Glu Thr Gin Gin Cys Lys Phe Asp Gly 

35 40 45 

Gin Glu Thr Lys Gly Ser Lys Phe He Thr Ser Ser Ala Ser Asp 

50 55 60 

Phe Ser Asp Pro Val Tyr Lys Glu He Ala He Thr Asn Gly Cys 

65 70 75 

He Asn Arg Met Ser Lys Glu Glu Leu Arg Ala Lys Leu Ser Glu 
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80 85 90 

Phe Lys Leu Glu Thr Arg Gly Val Lys Asp Val Leu Lys Lys Arg 

95 100 105 

Leu Lys Asn Tyr Tyr Lys Lys Gin Lys Leu Met Leu Lys Glu Ser 

110 115 120 

Asn Phe Ala Asp Ser Tyr Tyr Asp Tyr He Cys He He Asp Phe 

125 130 135 

Glu Ala Thr Cys Glu Glu Gly Asn Pro Pro Glu Phe Val His Glu 

140 145 150 

He He Glu Phe Pro Val Val Leu Leu Asn Thr His Thr Leu Glu 

155 160 165 

He Glu Asp Thr Phe Gin Gin Tyr Val Arg Pro Glu He Asn Thr 

170 175 180 

Gin Leu Ser Asp Phe Cys He Ser Leu Thr Gly He Thr Gin Asp 

185 190 195 

Gin Val Asp Arg Ala Asp Thr Phe Pro Gin Val Leu Lys Lys Val 

200 205 210 

He Asp Trp Met Lys Leu Lys Glu Leu Gly Thr Lys Tyr Lys Tyr 

215 220 225 

Ser Leu Leu Thr Asp Gly Ser Trp Asp Met Ser Lys Phe Leu Asn 

230 235 240 

He Gin Cys Gin Leu Ser Arg Leu Lys Tyr Pro Pro Phe Ala Lys 

245 250 255 

Lys Trp He Asn He Arg Lys Ser Tyr Gly Asn Phe Tyr Lys Val 

260 265 270 

Pro Arg Ser Gin Thr Lys Leu Thr He Met Leu Glu Lys Leu Gly 

275 280 285 

Met Asp Tyr Asp Gly Arg Pro His Cys Gly Leu Asp Asp Ser Lys 

290 295 300 

Asn He Ala Arg He Ala Val Arg Met Leu Gin Asp Gly Cys Glu 

305 310 315 

Leu Arg He Asn Glu Lys Met His Ala Gly Gin Leu Met Ser Val 

320 325 330 

Ser Ser Ser Leu Pro He Glu Gly Thr Pro Pro Pro Gin Met Pro 

335 340 345 

His phe Arg Lys 



<210> 4 
<211> 355 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> incyte clone 2547840 

<400> 4 

Met Ala Arg Arg Pro Arg Asn Ser Arg Ala Trp His Phe Val Leu 
15 10 15 

Ser Ala Ala Arg Arg Asp Ala Asp Ala Arg Ala Val Ala Leu Ala 

20 25 30 

Gly Ser Thr Asn Trp Gly Tyr Asp Ser Asp Gly Gin His Ser Asp 

35 40 45 

Ser Asp Ser Asp Pro Glu Tyr Ser Thr Leu Pro Pro Ser He Pro 
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50 55 60 

Ser Ala Val Pro Val Thr Gly Glu Ser Phe Cys Asp Cys Ala Gly 

65 70 75 

Gin Ser Glu Ala Ser Phe Cys Ser Ser Leu His Ser Ala His Arg 

80 85 90 

Gly Arg Asp Cys Arg Cys Gly Glu Glu Asp Glu Tyr Phe Asp Trp 

95 100 105 

Val Trp Asp Asp Leu Asn Lys Ser Ser Ala Thr Leu Leu Ser Cys 

110 115 120 

Asp Asn Arg Lys Val Ser Phe His Met Glu Tyr Ser Cys Gly Thr 

125 130 135 

Ala Ala lie Arg Gly Thr Lys Glu Leu Gly Glu Gly Gin His Phe 

140 145 150 

Trp Glu He Lys Met Thr Ser Pro Val Tyr Gly Thr Asp Met Met 

155 160 165 

Val Gly He Gly Thr Ser Asp Val Asp Leu Asp Lys Tyr Arg His 

170 175 180 

Thr Phe Cys Ser Leu Leu Gly Arg Asp Glu Asp Ser Trp Gly Leu 

185 190 195 

Ser Tyr Thr Gly Leu Leu His His Lys Gly Asp Lys Thr Ser Phe 

200 205 210 

Ser Ser Arg Phe Gly Gin Gly Ser He He Gly Val His Leu Asp 

215 220 225 

Thr Trp His Gly Thr Leu Thr Phe Phe Lys Asn Arg Lys Cys He 

230 235 240 

Gly Val Ala Ala Thr Lys Leu Gin Asn Lys Arg Phe Tyr Pro Met 

245 250 255 

Val Cys Ser Thr Ala Ala Arg Ser Ser Met Lys Val Thr Arg Ser 

260 265 270 

Cys Ala Ser Ala Thr Ser Leu Gin Tyr Leu Cys Cys His Arg Leu 

275 280 285 

Arg Gin Leu Arg Pro Asp Ser Gly Asp Thr Leu Glu Gly Leu Pro 

290 295 300 

Leu Pro Pro Gly Leu Lys Gin Val Leu His Asn Lys Leu Gly Trp 

305 310 315 

Val Leu Ser Met Ser Cys Ser Arg Arg Lys Ala Pro Val Ser Asp 

320 325 330 

Pro Gin Ala Ala Thr Ser Ala His Pro Ser Ser Arg Glu Pro Arg 

335 340 345 

Pro Cys Gin Arg Lys Arg Cys Arg Arg Thr 

350 355 



<210> 5 
<211> 421 
<212> PRT 

<213> Homo sapiens 
<220> 

<22i> misc_f eature 

<223> Incyte clone 3071986 

<400> 5 

Met Ala Ser Phe Pro Pro Arg Val Asn Glu Lys Glu He Val Arg 
15 10 15 
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Leu Arg TJir lie Gly Glu Leu Leu Ala Pro Ala Ala Pro Phe Asp 

20 25 30 

Lys Lys Cys Gly Arg Glu Asn Trp Thr Val Ala Phe Ala Pro Asp 

35 40 45 

Gly Ser Tyr Phe Ala Trp Ser Gin Gly His Arg Thr Val Lys Leu 

50 55 60 

Val Pro Trp Ser Gin Cys Leu Gin Asn Phe Leu Leu His Gly Thr 

65 70 75 

Lys Asn Val Thr Asn Ser Ser Ser Leu Arg Leu Pro Arg Gin Asn 

80 85 90 

Ser Asp Gly Gly Gin Lys Asn Lys Pro Arg Glu His lie lie Asp 

95 100 105 

Cys Gly Asp lie Val Trp Ser Leu Ala Phe Gly Ser Ser Val Pro 

110 X15 120 

Glu Lys Gin Ser Arg Cys Val Asn lie Glu Trp His Arg Phe Arg 

125 130 135 

Phe Gly Gin Asp Gin Leu Leu Leu Ala Thr Gly Leu Asn Asn Gly 

140 145 150 

Arg lie Lys lie Trp Asp Val Tyr Thr Gly Lys Leu Leu Leu Asn 

155 160 165 

Leu Val Asp His Thr Glu Val Val Arg Asp Leu Thr Phe Ala Pro 

170 175 180 

Asp Gly Ser Leu lie Leu Val Ser Ala Ser Arg Asp Lys Thr Leu 

185 190 195 

Arg Val Trp Asp Leu Lys Asp Asp Gly Asn Met Met Lys Val Leu 

200 205 210 

Arg Gly His Gin Asn Trp Val Tyr Ser Cys Ala Phe Ser Pro Asp 

215 220 225 

Ser Ser Met Leu Cys Ser Val Gly Ala Ser Lys Ala Val Phe Leu 

230 235 240 

Trp Asn Met Asp Lys Tyr Thr Met lie Arg Lys Leu Glu Gly His 

245 250 255 

His His Asp Val Val Ala Cys Asp Phe Ser Pro Asp Gly Ala Leu 

260 265 270 

Leu Ala Thr Ala Ser Tyr Asp Thr Arg Val Tyr lie Trp Asp Pro 

275 280 285 

His Asn Gly Asp lie Leu Met Glu Phe Gly His Leu Phe Pro Pro 

290 295 300 

Pro Thr Pro lie Phe Ala Gly Gly Ala Asn Asp Arg Trp Val Arg 

305 310 315 

Ser Val Ser Phe Ser His Asp Gly Leu His Val Ala Ser Leu Ala 

320 325 330 

Asp Asp Lys Met Val Arg Phe Trp Arg lie Asp Glu Asp Tyr Pro 

335 340 345 

Val Gin Val Ala Pro Leu Ser Asn Gly Leu Cys Cys Ala Phe Ser 

350 355 360 

Thr Asp Gly Ser Val Leu Ala Ala Gly Thr His Asp Gly Ser Val 

365 370 375 

Tyr Phe Trp Ala Thr Pro Arg Gin Val Pro Ser Leu Gin His Leu 

360 385 390 

Cys Arg Met Ser He Arg Arg Val Met Pro Thr Gin Glu Val Gin 

395 400 405 

Glu Leu Pro He Pro Ser Lys Leu Leu Glu Phe Leu Ser Tyr Arg 

410 415 420 

He 
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<210> 6 
<211> 278 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> Tnisc_f eature 

<223> Incyte clone 3434619 



<400> 6 
Met Glu Pro 
1 

Phe Trp Val 
Glu Ser Leu 
Asn Gin Val 
Leu Gin Gly 
Ala Gin Val 
Asp Ala Cys 
Ser Tyr Gly 
Leu His Glu 
Leu lie Asp 
Gly Thr Pro 
Val Lys Val 
Leu His Glu 
Asp Leu lie 
Arg Asp Asn 
Ser Ala Pro 
Thr Leu Ser 
Val Arg Gly 
Leu lie Asp 



Arg Ala Ala 
5 

Glu Arg Thr 
20 

Gin Leu Gin 
35 

Thr Val Asp 
50 

Gin Ala Arg 
65 

Asp Ala Arg 
80 

Ala Ser Gly 
95 

Ala Lys Val 

110 
Ala Cys Met 

125 
Val Gly Ala 

140 
Leu His Val 

155 
Leu Leu Asn 

170 
Thr Ala Leu 

185 
Glu Met Leu 

200 
Arg Gly Lys 

215 
Ala Lys Cys 

230 
Gin Leu Cys 

245 
Leu Glu Lys 

260 
Tyr Leu Ser 

275 



Asp Gly Cys 
Pro Val His 
Gin Leu lie 
Ser lie Thr 
Cys Val Gin 
Asn lie Asp 
Ser lie Glu 
Asn Pro Pro 
Ser Gly Ser 
Asn Leu Glu 
Ala Cys Ala 
Ala Gly Ala 
His His Ala 
lie Glu Phe 
Lys Pro Ser 
Phe Glu Tyr 
Arg Val Asn 
lie Ala Lys 
Tyr Asn 



Phe Leu Gly 
10 

Glu Ala Ala 
25 

Glu Ser Gly 
40 

Pro Leu His 
55 

Leu Leu Leu 
70 

Gly Ser Thr 
85 

Cys Val Lys 
100 

Leu Tyr Thr 
115 

Ser Glu Cys 
130 

Ala His Asp 
14 5 

Arg Glu His 
160 

Asn Val Asn 
175 

Ala Lys Val 
190 

Gly Gly Asn 
205 

Asp Tyr Thr 
220 

Tyr Glu Lys 
235 

Leu Arg Lys 
250 

Leu Asn lie 
265 



Asp Val Gly 
15 

Gin Arg Gly 
30 

Ala Cys Val 
45 

Ala Ala Ser 
60 

Ala Ala Gly 
75 

Pro Leu Cys 
90 

Leu Leu Leu 
105 

Ala Ser Pro 
120 

Val Arg Leu 
135 

Cys His Phe 
150 

Leu Asp Cys 
165 

Ala Ala Lys 

180 

Lys Asn Val 
195 

lie Tyr Ala 
210 

Trp Ser Ser 
225 

Thr Pro Leu 
240 

Ala Thr Gly 
255 

Pro Pro Arg 
270 



<210> 7 
<211> 281 
<212> PRT 

<213> Homo sapiens 
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<220> 

<221> misc_f eature 

<223> Incyte clone 1275743 

<400> 7 



Met 


Gly 


Ser 


Gin 


Gly 


Ser 


Pro 


Val 


Lys 


Ser 


Tyr 


Asp 


Tyr 


Leu 


Leu 


1 








5 










10 










15 


Lys 


Phe 


Leu 


Leu 


Val 


Gly Asp 


Ser 


Asp 


Val 


Gly 


Lys 


Gly 


Glu 


He 










20 










25 










30 


Leu 


Glu 


Ser 


Leu 


Gin 


Asp 


Gly Ala 


A.1 a 


Glu 


Ser 


Pro 


Tyr 


Ala 


Tyr 










35 










40 










45 


Ser 


Asn 


Gly 


lie 


Asp 
50 


Tyr 


Lys 


Thr 


Thr 


Thr 
55 


He 


Leu 


Leu 


Asp 


Gly 
60 


Arg 


Arg 


Val 


Lys 


Leu 


Glu 


Leu 


Trp 


Asp 


Thr 


Ser 


Gly 


Gin 


Gly Arg 










65 










70 










75 


Phe 


Cys 


Thr 


lie 


Phe 


Arg 


Ser 


Tyr 


Ser 


Arg 


Gly Ala 


Gin 


Gly 


He 










80 










85 










90 


Len 


Leu 


Val 


Tyr 


Asp 
95 


He 


Thr 


Asn 


Arg 


Trp 
100 


Ser 


Phe 


Asp 


Gly 


He 
105 


Asp 


Arg 


Trp 


He 


Lys 
110 


Glu 


He 


Asp 


Glu 


His 
115 


Ala 


Pro 


Gly 


Val 


Pro 
120 


Arg 


lie 


Leu 


Val 


Gly Asn Arg 


Leu 


His 


Leu 


Awl a 


Phe 


Lys 


Arg 


Gin 










125 










130 










135 


Val 


Pro 


Thr 


Glu 


Gin 
140 


Ala 


Arg 


Ala 


Tyr 


Ala 
145 


Glu 


Lys 


Asn 


Cys 


Met 
150 


Thr 


Phe 


Phe 


Glu 


Val 
155 


Ser 


Pro 


Leu 


Cys 


Asn 
160 


Phe 


Asn 


Val 


He 


Glu 
165 


Ser 


Phe 


Thr 


Glu 


Leu 


Ser 


Arg 


He 


Val 


Leu 


Met 


Arg 


His 


Gly Met 










170 










175 










180 


Glu 


Lys 


lie 


Trp 


Arg 
185 


Pro 


Asn 


Arg 


Val 


Phe 
190 


Ser 


Leu 


Gin 


Asp 


Leu 
195 


Cys 


Cys 


Arg 


Ala 


He 
200 


Val 


Ser 


Cys 


Thr 


Pro 
205 


Val 


His 


Leu 


He 


Asp 
210 


Lys 


Leu 


Pro 


Leu 


Pro 
215 


Val 


Thr 


He 


Lys 


Ser 
220 


His 


Leu 


Lys 


Ser 


Phe 
225 


Ser 


Met 


Ala 


Asn 


Gly Met Asn Ala Val 


Met 


Met 


His 


Gly 


Arg 


Ser 










230 










235 










240 


Tyr 


Ser 


Leu 


Ala 


Ser 


Gly Ala 


Gly Gly 


Gly 


Gly 


Ser 


Lys 


Gly Asn 










245 










250 










255 


Ser 


Leu 


Lys 


Arg 


Ser 
260 


Lys 


Ser 


He 


Arg 


Pro 
265 


Pro 


Gin 


Ser 


Pro 


Pro 
270 


Gin 


Asn 


Cys 


Ser 


Arg 

275 


Ser 


Asn 


Cys 


Lys 


He 
280 


Ser 











<210> 8 
<211> 635 
<212> PRT 

<213> Homo sapiens 
<220> 

<22l> mis Cofeature 

<223> Incyte clone 1722533 

<400> 8 

Met Ala Thr Gin He Ser Thr Arg Gly Ser Gin Cys Thr He Gly 
15 10 15 



8/19 



wo 99/61614 



PCT/US99/I1497 



Gin Glu Glu Tyr Ser Leu Tyr Ser Ser Leu Ser Giu Asp Glu Leu 

20 25 30 

Val Gin Met Ala lie Glu Gin Ser Leu Ala Asp Lys Thr Arg Gly 

35 40 45 

Pro Thr Thr Ala Glu Ala Thr Ala Ser Ala Cys Thr Asn Arg Gin 

50 55 60 

Pro Ala His Phe Tyr Pro Trp Thr Arg Ser Thr Ala Pro Pro Glu 

65 70 75 

Ser Ser Pro Ala Arg Ala Pro Met Gly Leu Phe Gin Gly Val Met 

80 85 90 

Gin Lys Tyr Ser Ser Ser Leu Phe Lys Thr Ser Gin Leu Ala Pro 

95 100 105 

Ala Asp Pro Leu lie Lys Ala lie Lys Asp Gly Asp Glu Glu Ala 

110 115 120 

Leu Lys Thr Met lie Lys Glu Gly Lys Asn Leu Ala Glu Pro Asn 

125 130 135 

Lys Glu Gly Trp Leu Pro Leu His Glu Ala Ala Tyr Tyr Gly Gin 

140 145 150 

Val Gly Cys Leu Lys Val Leu Gin Arg Ala Tyr Pro Gly Thr lie 

155 160 165 

Asp Gin Arg Thr Leu Gin Glu Glu Thr Ala Val Tyr Leu Ala Thr 

170 175 180 

Cys Arg Gly His Leu Asp Cys Leu Leu Ser Leu Leu Gin Ala Gly 

185 190 195 

Ala Glu Pro Asp He Ser Asn Lys Ser Arg Glu Thr Pro Leu Tyr 

200 205 210 

Lys Ala Cys Glu Arg Lys Asn Ala Glu Ala Val Lys lie Leu Val 

215 220 225 

Gin His Asn Ala Asp Thr Asn His Arg Cys Asn Arg Gly Trp Thr 

230 235 240 

Ala Leu His Glu Ser Val Ser Arg Asn Asp Leu Glu Val Met Gin 

245 250 255 

He Leu Val Ser Gly Gly Ala Lys Val Glu Ser Lys Asn Ala Tyr 

260 265 270 

Gly He Thr Pro Leu Phe Val Ala Ala Gin Ser Gly Gin Leu Glu 

275 280 285 

Ala Leu Arg Phe Leu Ala Lys Tyr Gly Ala Asp He Asn Thr Gin 

290 295 300 

Ala Ser Asp Asn Ala Ser Ala Leu Tyr Glu Ala Cys Lys Asn Glu 

305 310 315 

His Glu Glu Val Val Glu Phe Leu Leu Ser Gin Gly Ala Asp Ala 

320 325 330 

Asn Lys Thr Asn Lys Asp Gly Leu Leu Pro Leu His He Ala Ser 

335 340 345 

Lys Lys Gly Asn Tyr Arg He Val Gin Met Leu Leu Pro Val Thr 

350 355 360 

Ser Arg Thr Arg He Arg Arg Ser Gly Val Ser Pro Leu His Leu 

365 370 375 

Ala Ala Glu Arg Asn His Asp Glu Val Leu Glu Ala Leu Leu Ser 

380 385 390 

Ala Arg Phe Asp Val Asn Thr Pro Leu Ala Pro Glu Arg Ala Arg 

395 400 405 

Leu Tyr Glu Asp Arg Arg Thr Ser Ala Leu Tyr Phe Ala Val Val 

410 415 420 

Asn Asn Asn Val Tyr Ala Thr Glu Leu Leu Leu Gin His Gly Ala 

425 430 435 

Asp Pro Asn Arg Asp Val He Ser Pro Leu Leu Val Ala He Arg 
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440 445 450 

His Gly Cys Leu Arg Thr Met Gin Leu Leu Leu Asp His Gly Ala 

455 460 465 

Asn He Asp Ala Tyr He Ala Thr His Pro Thr Ala Phe Pro Ala 

470 475 430 

Thr He Met Phe Ala Met Lys Cys Leu Ser Leu Leu Lys Phe Leu 

485 490 495 

Met Asp Leu Gly Cys Asp Gly Glu Pro Cys Phe Ser Cys Leu Tyr 

500 505 510 

Gly Asn Gly Pro His Pro Pro Ala Pro Gin Pro Ser Ser Arg Phe 

515 520 525 

Asn Asp Ala Pro Ala Ala Asp Lys Glu Pro Ser Val Val Gin Phe 

530 535 540 

Cys Glu Phe Val Ser Ala Pro Glu Val Ser Arg Trp Ala Gly Pro 

545 550 555 

He He Asp Val Leu Leu Asp Tyr Val Gly Asn Val Gin Leu Cys 

560 565 570 

Ser Arg Leu Lys Glu His He Asp Ser Phe Glu Asp Trp Ala Val 

575 580 585 

He Lys Glu Lys Ala Glu Pro Pro Arg Pro Leu Ala His Leu Cys 

590 595 600 

Arg Leu Arg Val Arg Lys Ala He Gly Lys Tyr Arg He Lys Leu 

605 610 615 

Leu Asp Thr Leu Pro Leu Pro Gly Arg Leu He Arg Tyr Leu Lys 

620 625 630 

Tyr Glu Asn Thr Gin 

635 



<210> 9 
<211> 518 
<2X2> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Xncyte clone 1759763 

<400> 9 

Met Asp Phe Thr Glu Ala Tyr Ala Asp Thr Cys Ser Thr Val Gly 

15 10 15 

Leu Ala Ala Arg Glu Gly Asn Val Lys Val Leu Arg Lys Leu Leu 

20 25 30 

Lys Lys Gly Arg Ser Val Asp Val Ala Asp Asn Arg Gly Trp Met 

35 40 45 

Pro He His Glu Ala Ala Tyr His Asn Ser Val Glu Cys Leu Gin 

50 55 60 

Met Leu He Asn Ala Asp Ser Ser Glu Asn Tyr He Lys Met Lys 

65 70 75 

Thr Phe Glu Gly Phe Cys Ala Leu His Leu Ala Ala Ser Gin Gly 

80 85 90 

His Trp Lys He Val Gin He Leu Leu Glu Ala Gly Ala Asp Pro 

95 100 105 

Asn Ala Thr Thr Leu Glu Glu Thr Thr Pro Leu Phe Leu Ala Val 

110 115 120 
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Glu Asn Gly Gin lie Asp Val Leu Arg Leu Leu Leu Gin His Gly 

125 130 135 

Ala Asn Val Asn Gly Ser His Ser Met Cys Gly Trp Asn Ser Leu 

140 145 150 

His Gin Ala Ser Phe Gin Glu Asn Ala Glu lie lie Lys Leu Leu 

155 160 165 

Leu Arg Lys Gly Ala Asn Lys Glu Cys Gin Asp Asp Phe Gly lie 

170 175 180 

Thr Pro Leu Phe Val Ala Ala Gin Tyr Gly Lys Leu Glu Ser Leu 

185 190 195 

Ser He Leu He Ser Ser Gly Ala Asn Val Asn Cys Gin Ala Leu 

200 205 210 

Asp Lys Ala Thr Pro Leu Phe He Ala Ala Gin Glu Gly His Thr 

215 220 225 

Lys Cys Val Glu Leu Leu Leu Ser Ser Gly Ala Asp Pro Asp Leu 

230 235 240 

Tyr Cys Asn Glu Asp Ser Trp Gin Leu Pro He His Ala Ala Ala 

245 250 255 

Gin Met Gly His Thr Lys He Leu Asp Leu Leu He Pro Leu Thr 

260 265 270 

Asn Arg Ala Cys Asp Thr Gly Leu Asn Lys Val Ser Pro Val Tyr 

275 280 285 

Ser Ala Val Phe Gly Gly His Glu Asp Cys Leu Glu He Leu Leu 

290 295 300 

Arg Asn Gly Tyr Ser Pro Asp Ala Gin Ala Cys Leu Val Phe Gly 

305 310 315 

Phe Ser Ser Pro Val Cys Met Ala Phe Gin Lys Asp Cys Glu Phe 

320 325 330 

Phe Gly He Val Asn He Leu Leu Lys Tyr Gly Ala Gin He Asn 

335 340 345 

Glu Leu His Leu Ala Tyr Cys Leu Lys Tyr Glu Lys Phe Ser He 

350 355 360 

Phe Arg Tyr Phe Leu Arg Lys Gly Cys Ser Leu Gly Pro Trp Asn 

365 370 375 

His He Tyr Glu Phe Val Asn His Ala He Lys Ala Gin Ala Lys 

380 385 390 

Tyr Lys Glu Trp Leu Pro His Leu Leu Val Ala Gly Phe Asp Pro 

395 400 405 

Leu He Leu Leu Cys Asn Ser Trp He Asp Ser Val Ser He Asp 

410 415 420 

Thr Leu He Phe Thr Leu Glu Phe Thr Asn Trp Lys Thr Leu Ala 

425 430 435 

Pro Ala Val Glu Arg Met Leu Ser Ala Arg Ala Ser Asn Ala Trp 

440 445 450 

He Leu Gin Gin His He Ala Thr Val Pro Ser Leu Thr His Leu 

455 460 465 

Cys Arg Leu Glu He Arg Ser Ser Leu Lys Ser Glu Arg Leu Arg 

470 475 480 

Ser Asp Ser Tyr He Ser Gin Leu Pro Leu Pro Arg Ser Leu His 

485 490 495 

Asn Tyr Leu Leu Tyr Glu Asp Val Leu Arg Met Tyr Glu Val Pro 

500 505 510 

Glu Leu Ala Ala He Gin Asp Gly 

515 
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<210> 10 

<211> 1117 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte clone 1758450 



<400> 10 

cacgccttga cagcggcttt caacccccac ctcagcccag caattcggca gtttggagca 60 
tgtgaacacc ttgagccttg atgagttcca gtatgtggta tattatgcag agcattcaga 120 
gcaaatactc tctctccgag cgcttaatcc gaacaattgc tgccatccgt tccttcccac 180 
atgataatgt agaggacctc atcagagggg gagcagatgt gaactgcact catggcacac 240 
tgaagccctt gcactgtgcc tgtatggtgt cagatgctga ctgtgtggag ttacttctgg 300 
aaaaaggagc cgaggtgaat gccctggatg ggtataaccg aacagccctc cactatgcag 360 
cagagaaaga tgaggcttgt gtggaggtcc tattggagta tggtgcaaac cccaatgctt 420 
tggatggcaa cagagatacc ccacttcact gggcagcctt taagaacaat gctgagtgtg 480 
tgcgggctct cctagagagc ggggcctctg tcaatgccct ggattacaac aatgatacac 540 
cgctcagctg ggctgccatg aagggaaatc ttgagagtgt cagcatcctt ctggattatg 600 
gcgcagaggt cagagtcatc aacctaatag gccagacacc catctcccgc ctggtggctc 660 
tgctagtcag gggacttgga acagagaaag aggactcttg ctttgagctc ctccacagag 720 
ctgttggaca ctttgaattg aggaaaaatg gcaccatgcc acgagaggtg gccagagacc 780 
cgcagctatg tgaaaaactg actgttctgt gctcagctcc aggaactcta aaaacactcg 840 
ctcgctatgc cgtgcgccgt agcctgggac tccagtafcct ccccgatgca gtgaagggcc 900 
ttccactgcc agcttctttg aaggaatacc tgttactttt agaatagccg gagaagatgt 960 
ttgcaccatc gtgcaggcag ctctgggtga ggttgtccct gcagtactcc ttgtcacaga 1020 
aaacagaaaa acagttgttt cctgatgtgt gggttataga tttcgaagca acatgtcaca 1080 
acaataacct gcatagcaac ticccctttcc aaacaaa 1117 



<210> 11 

<211> 2589 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte clone 1834242 



<400> 11 

cttgaatgaa gctgacacca agaaccgcgg gaagagcttg ggcccaaagc aggaaaggga 60 
agcgctcgag ttggggagga accgctgctg tggccgaact caagcccggg cgcccccacc 12 0 
agtttgattg gaagtccagc tgtgaaacct ggagcgtcgc cttctcccca gatggctcct 18 0 
ggtttgcttg gtctcaagga cactgcatcg tcaaactgat cccctggccg ttggaggagc 240 
agttcatccc taaagggttt gaagccaaaa gccgaagtag caaaaatgag acgaaagggc 3 00 
ggg^cagccc aaaagagaag acgctggact gtggtcagat tgtctggggg ctggccttca 360 
gcccgtggcc ttccccaccc agcaggaagc tctgggcacg ccaccacccc caagtgcccg 42 0 
atgtctcttg cctggttctt gctacgggac tcaacgatgg gcagatcaag atctgggagg 480 
tgcagacagg gctcctgctt ttgaatcttt ccggccacca agatgtcgtg agagatctga 540 
gcttcacacc cagtggcagt ttgattttgg tctccgcgtc acgggataag actcttcgca 600 
tctgggacct gaataaacac ggtaaacaga ttcaagtgtt atcgggccac ctgcagtggg 66 0 
tttactgctg ttccatctcc ccagactgca gcatgctgtg ctctgcagct ggagagaagt 720 
cggtctt-tct atggagcatg aggtcctaca cgttaattcg gaagctagag ggccatcaaa 780 
gcagtgttgt ctcttgtgac ttctcccccg actctgccct gcttgtcacg gcttcttacg 84 0 
ataccaatgt gattatgtgg gacccctaca ccggcgaaag gctgaggtca ctccaccaca 900 
cccaggttga ccccgccatg gatgacagtg acgtccacat tagctcactg agatctgtgt 96 0 
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gcttctctcc ggaaggcttg taccttgcca cggtggcaga tgacagactc ctcaggatct 102 0 
gggccctgga actgaaaact cccattgcat ttgctcctat gaccaatggg ctttgctgca 1080 
cattttttcc acatggtgga gtcattgcca cagggacaag agatggccac gfcccagttct 1140 
ggacagctcc tagggtcctg tcctcactga agcacttatg ccggaaagcc cttcgaagtt 1200 
tcctaacaac ttaccaagtc ctagcactgc caatccccaa gaaaatgaaa gagttcctca 1260 
catacaggac tttttaagca acaccacatc ttgtgcttct ttgtagcagg gtaaatcgtc 132 0 
ctgtcaaagg gagttgctgg aataatgggc caaacatctg gtcttgcatt gaaatagcat 138 0 
ttctttggga ttgtgaatag aatgtagcaa aaccagattc cagtgtacta gtcatggatc 1440 
tttctctccc tggcatgtga aagtcagtct tagaggaaga gattccactt gcacggcaac 1500 
agagccttac gttaaatctt cagtccagtt atgaacagca agtgttgaac tctttctgct 1560 
tgttttgatt caaagtgcag ttactgatgt tgttttgatt atgcaactaa gtaggcctcc 1620 
agagcctctc tagtggcaga gcagctcaca ctccctccgc tgggaacgat ggcttctgcc 1680 
tagtaccfcat ccttgtgttt ctgatgcagt ggtagcattg gttcaagttc tctcctgctg 174 0 
tggtcagagt tgcttcgatg ttggccaagt gcttttcttc ttgggctccc ttctgacctg 1800 
caggacagtt ttcctggagc catttggtat gaggtattaa tttagcttaa ctaaattaca 1860 
ggggactcag aggccgtgct cctgaccgat ccagacacta ttactggctt tttttttttt 1920 
tttttaacaa tggtgtgcat gtgcaggaaa tgacaaattt gtatgtcaga ttatacaagg 1980 
atgtattctt aaaccgcatg actattcaga tggctactga gttatcagtg gccatttatt 2 04 0 
agcatcatat ttatttgtat tttctcaaca gatgttaagg tacaactgtg tttttctcga 2100 
ttatctaaaa accatagtac ttaaattgaa cagttgcaaa gatgtcttaa ttgtgtaaag 216 0 
aattggtgta gtcatgactt tagctgatac tcttatgtac gagatctgtc tctgctgttt 222 0 
aacttcattg gattaatcag ctggtttcaa ctctactgcg aaacaaaaat agctccttaa 2280 
aagtactgtt ctccttcagt ggcatgtagt tatctaatca agacacctca ttcaaacaaa 234 0 
acctgcctta ggaaaattta atatatttta aattatttta aaagaaatac aacatcttat 2400 
tctttagctt tcttaatcgg tgctttatgg aggccagtgt aacgttacat gactcgttga 2460 
gaaagttgag gaatttcctc taccaccttt gttgcttgaa gaaaaacatg tcttttcaaa 252 0 
atgagaggct ttcattgaag aaaagaaaaa aacaacagtt aaaagctaaa aaaaaaaaaa 2580 
aaaaaaaaa 258 9 



<210> 12 

<211> 2038 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte clone 1849725 

<400> 12 

cgggaacgcg agcccggtaa tttttcaacg gagaaaggcg aggctttcgg gctctgcaga 6 0 
gtgagagtta gcaagtgtcc ggctccagca actctcctct ggcgtgacag ccggcatgga 12 0 
ggatccacag agtaaagagc ctgccggcga ggccgtggct cccgcgctgc tggagtcgcc 18 0 
gcggccggag ggcggggagg agccgccgcg tcccagtccc gaggaaactc aacagtgtaa 240 
atttgatggc caggagacaa aaggatccaa gttcattacc tccagtgcga gtgacttcag 300 
tgacccggtt tacaaagaga ttgccattac gaatggctgt attaatagaa tgagtaagga 360 
agaactcaga gctaagcttt cagaattcaa gcttgaaact agaggagtaa aggatgttct 42 0 
aaagaagaga ctgaaaaact attataagaa gcagaagctg atgctgaaag agagcaattt 480 
tgctgacagt tattatgact acatttgtat tattgacttt gaagccactt gtgaagaagg 540 
aaacccacct gagtttgtac atgaaataat tgaatttccg gttgttttac tgaatacgca 600 
tactttagaa atagaagaca cgtttcagca gtatgtaaga ccagagatta acacacagct 660 
gtctgatttc tgcatcagtc taactggaat tactcaggat caggtagaca gagctgatac 72 0 
cttccctcag gtactaaaaa aagtaattga ctggatgaaa ttgaaggaat taggaacaaa 78 0 
gtataaatac tcacttttaa cagatggttc ttgggatatg agtaagttct tgaacattca 840 
gtgtcaactc agcaggctca aataccctcc ttttgcgaaa aagtggatca atattcggaa 900 
gtcatatgga aatttttaca aggttcctag aagccaaacc aaactgacaa taatgcttga 960 
aaaattagga atggattatg atgggcggcc tcactgtggt cttgatgact ctaagaatat 102 0 
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cgcccgaata gcagttcgaa tgcttcagga tgggtgtgaa ctccgaatca acgagaaaat 1080 

gcatgcagga cagctaatga gtgtgtcctc ttccttacca atagagggca ctccaccacc 1140 

acaaatgcca cattttagaa agtaacagtt ttgtgtgtgg atcattccaa ttgaagttgc 1200 

tatgaagagg tagcagatga atctcattga attagtcctg tagtgcaaac tttaagcacc 1260 

ttaaaacatt taaaatctta ttacaggtga tagagataga tacatgtatg tgaacagatt 1320 

ttgtaggaag gcatactgaa ttctttgtca ccaagcactt ttgataatgg acaggaatcc 1380 

ggtaacctag ataaccaagg tcctggtcta acacaatggg atattttaat aattttaaag 1440 

aggsggttcc acaggttata aattcccttt ttttgggtgt ttaaaaaaat ggcccaaaaa 1500 

tctcctaaat atggggcctt ggtgtctctc ggtttggaaa atgggccaac aaatcccttt 1560 

aactcggggg tggtggttta acaaataaat gggtagaaat ggggtggggt tttccccttt 162 0 

taaatttaaa accatttcca ccttaaggga ttggtaaaca ccccccttaa atccccttta 16 80 

aaaaaattgg tcccgggaaa aattgggatt tgggggcaaa agggtaagga attcctgtaa 1740 

tccctaaagg cctctctttg ggggaatttt tccccagggg gaatatatcc ccttaaggtg 1800 

cccccctttt gtggaatttt tttccccaaa aggggtttat aataaatgtt gggaaaagtt 1860 

ttccaccccc aaggggaaat gggtgggggt gggaaaattt tccggtaaaa gaggtgacac 1920 

tttggggtag atgacccata aatacttgcg cctcaagggg gtttgcccct attttcaaaa 1980 

aactccccta aaaatttggg gggaggagaa ttttatttgg attagggggg tttatata 2038 



<210> 13 

<211> 1537 

<212> DHA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte clone 2547840 

<400> 13 

tgggtcggac gcgtgggcgg aaagtgggtc agggccgggc cggcggagcg cgcagcgggg 6 0 
cagccagatt ctttccacca tggccagacg cccccggaac agcagggcct ggcacttcgt 120 
cctgagtgca gcccgccgag acgcagatgc ccgggccgtg gctctagcag gctccactaa 180 
ctggggctac gactctgatg ggcagcacag cgactcggac tccgaccccg agtactccac 24 0 
gctgccgcca tccatcccca gtgcggtgcc cgfcgaccggc gagtccttct gtgactgtgc 3 00 
tgggcagagc gaggcctcct tctgtagcag cctgcactcg gcccaccggg gcagggactg 360 
ccgctgcgga gaggaagacg agtatttcga ctgggtctgg gatgacttaa ataagtcatc 42 0 
agccaccctg ctgagctgtg acaaccgtaa ggtcagcttc cacatggagt acagctgcgg 480 
cacagcggcc atccggggca ccaaggagct gggggagggc cagcacttct gggagatcaa 54 0 
gatgacctct cccgtctacg gcaccgacat gatggtgggc atcgggacgt cggatgtgga 600 
cctggacaaa taccgccaca cgttctgcag cctgctgggc agggatgagg acagctgggg 66 0 
cctctcctac acgggcctcc tccaccacaa gggcgacaag accagcttct cgtcgcggtt 720 
cggccagggc tccatcattg gcgtgcacct ggacacctgg cacggcacac tcaccttttt 780 
caagaacagg aagtgtatag gtgtggcagc caccaagctg cagaacaaga gattctaccc 840 
gatggtgtgc tccacggcgg cccggagcag catgaaggtc acccgctcct gtgccagcgc 900 
cacttccctc cagtacctgt gctgccaccg cctgcgccag ctgcggccag actcgggaga 960 
cacgctggag ggtctgccgc tgccgccggg ccticaagcag gtgctacaca acaagctggg 1020 
ctgggtcctg agcatgagtt gcagccgccg caaggctcca gtgtccgatc cccaggcagc 1080 
gacctccgcc caccccagca gtcgcgagcc tcggccctgc cagaggaagc gctgccgccg 1140 
gacctgactg acttcccagt ggaactgcct tcttgggctg ggacagcccc tttcctctgt 1200 
cccttctttc tctgtccctt ccttccagcc acactccagg gcggagttgg atgaggcccg 1260 
tccggaggga gccatctctt gctcccgagg ctgggacagt cctttctgtg ggggctctag 1320 
ggcccctctg ctgctgtgct gggtggggaa gcggctgccc tgagccccag gtcttgtggg 1380 
aggctgcgag gacgagagcc tiggctggagc ccgcgttgct gttcccacag ggcctcggtt 1440 
tttcctaact tgctctgcat gctgtcagcg gctgccccgc cgtcatagac ttaaaggact 1500 
gcaataaatg tagagttgat gtctaacaaa aaaaaaa 1537 
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<210> 14 

<211> 2203 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte clone 3071986 

<400> 14 

ctgtcttcct ccgcagcgcg aggctgggta cagggtctat tgtctgtggt tgactccgta 60 

ctttggtctg aggccttcgg gagctttccc gaggcagtta gcagaagccg cagcggccgc 12 0 

ccccgcccgt ttcctctgtc cctgggcccg ggagggacca acttggcgtc acgcccctca 180 

gcggtcgcca ctctcttctc tgttgttggg tccgcatcgt attcccggaa tcagacggtg 24 0 

ccccatagat ggccagcttt cccccgaggg tcaacgagaa agagatcgtg agattacgta 3 00 

ctataggtga acttttagct cctgcagctc cttttgacaa gaaatgtggt cgtgaaaatt 36 0 

ggactgttgc ttttgctcca gatggttcat actttgcttg gtcacaagga catcgcacag 42 0 

taaagcttgt tccgtggtcc cagtgccttc agaactttct cttgcatggc accaagaatg 480 

ttaccaattc aagcagttta agattgccaa gacaaaatag tgatggtggt cagaaaaata 540 

agcctcgtga acatattata gactgtggag atatagtctg gagtcttgct tttgggtcat 600 

cagttccaga aaaacagagt cgctgtgtaa atatagaatg gcatcgcttc agatttggac 660 

aagatcagct acttcttgct acagggttga acaatgggcg tafccaaaata tgggatgtat 72 0 

atacaggaaa actcctcctt aacttggtag atcatactga agtggtcaga gatttaactt 78 0 

ttgctccaga tggaagcttg atcctggtgt cagcttcaag agacaaaact ctcagagtat 84 0 

gggacctgaa agatgatgga aacatgatga aagtattgag ggggcatcag aattgggtgt 900 

acagctgtgc attctctcct gactcttcta tgctgtgttc agtcggagcc agtaaagcag 960 

ttttcctttg gaatatggat aaatacacca tgatacggaa actagaagga catcaccatg 1020 

atgtggtagc ttgtgacttt tctcctgatg gagcattact ggctactgca tcttatgata 1080 

ctcgagtata tatctgggat ccacataatg gagacattct gatggaattt gggcacctgt 1140 

ttcccccacc tactccaata tttgctggag gagcaaatga ccggtgggta cgatctgtat 1200 

cttttagcca tgatggactg catgttgcaa gccttgctga tgataaaatg gtgaggttct 1260 

ggagaattga tgaggattat ccagtgcaag ttgcaccttt gagcaatggt ctttgctgtg 132 0 

ccttctctac tgatggcagt gttttagctg ctgggacaca tgacggaagt gtgtattttt 13 80 

gggccactcc acggcaggtc cctagcctgc aacatttatg tcgcatgtca atccgaagag 1440 

tgatgcccac ccaagaagtt caggagctgc cgattccttc caagcttttg gagtttctct 1500 

cgtatcgtat ttagaagatt ctgccttccc tagtagtagg gactgacaga atacacttaa 1560 

cacaaacctc aagctttact gacttcaatt atctgttttt aaagacgtag aagatttatt 1620 

taatttgata tgttcttgta ctgcattttg atcagttgag cttttaaaat attatttata 1680 

gacaatagaa gtatttctga acatatcaaa tataaatttt tttaaagatc taactgtgaa 1740 

aacatacata cctgtacata tttagatata agctgctata tgttgaatgg acccttttgc 1800 

ttttctgatt tttagttctg acatgtatat attgcttcag tagagccaca atatgtatct 1860 

ttgctgtaaa gtgcaaggaa attttaaatt ctgggacact gagttagatg gtaaatactg 1920 

acttacgaaa gttgaattgg gtgaggcggg caaatcacct gaggtcagca gtttgagact 1980 

agcctggcaa acatgatgaa accctgtctc tactaaaaat acaaaaaaaa aaaaaattag 2040 

ccaggcgtgg tggtgcacac ctgtagtcct agctacttgg gaggctgagg caggagaatt 2100 

gcttgaaccc aggaggtgga ggttgcagta agccaagatc acaccactgc actccaacct 2160 

ggacaacaga gcgagactcc atctcaaaaa aaaaataaaa agg 2203 



<210> 15 

<211> 1622 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mi sc_f eature 

<223> Incyte clone 3484619 
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<400> 15 

ccgcatggag ccccgggcgg cggacggctg 
gcggacccct gtgcacgagg cagcccagcg 
cgagagcggc gcctgcgtga accaggtcac 
cagtctgcag ggccaggcgc ggtgtgtgca 
tgctcgcaac atcgacggca gcaccccgct 
gtgtgtgaag ctcttgctgt cctacggggc 
ccccctgcac gaggcctgca tgagcgggag 
cggggccaat ctggaagcgc acgattgcca 
ccgggagcat ctggactgtg tcaaagtgct 
aaagcttcat gagactgccc ttcaccacgc 
gatgcttatc gagtttggcg gcaacatcta 
tgactacacg tggagcagca gcgctcccgc 
tctgactctg tcacagctct gcagggtgaa 
ggagaagatt gccaagttaa acatcccgcc 
aattgcaggt ggggtccgga ccgtgactgc 
ggctctgcct gttcctctga agcagcgtga 
tcccttcctg cgatcctgtt taggcttctc 
ttgggaaggc ctgcgtctca ggtcacagtt 
cctaccctct caatgggcat gggcccaacc 
caaaggcact ctcaggacag acggcgtggg 
ctgagggcat tgttgctgat tctcactcac 
caggtgcagg ccaccacctc cgggtcggca 
ggctgggccc acggaaggtc ggcctgggat 
tgtgtggatg cttttcctgg agcactttcc 
atctgtcact cagggacttt ctatgggtct 
cacaagtagc actgagtttc ttaggaaatt 
aaaaagttta tatgtaggat aaaaaccttt 
tt 



cttcctgggc gacgtgggtt tctgggtgga 6 0 
gggtgagagc ctgcagctgc aacagctgat 120 
cgtggact cc atcacgcccc tgcacgcagc 180 
gctgctgctg gcggctgggg cccaggtgga 24 0 
ctgcgatgcc tgcgcctcgg gcagcatcga 300 
caaggtcaac cctcccctgt acacagcgtc 360 
ttccgaatgt gtgaggcttc ttattgacgt 42 0 
ttttgggacc cctctgcacg ttgcctgtgc 480 
gctcaatgca ggggccaacg tgaatgcggc 540 
ggccaaggtc aagaatgttg acctcatcga 600 
cgcccgggac aaccgcggga agaagccgtc 660 
caagtgcttc gagtactacg aaaagacacc 720 
cttgaggaag gccactggcg tccgagggct 780 
ccggctcatt gattacctct cctacaactg 840 
ccccgttgtg cccagcattg cccgggtgag 900 
ttgctgtaga tagaacaacg ctccttcgag 96 0 
tcctggatcc tggataatgt ttccagggtg 1020 
gtgggtgtgg ccctgcgctg ttctacagaa 10 80 
atccagtttt cctcttttac ggaccatcct 1140 
gagcacagag gaggctggca gagctgggga 1200 
cggggcagcc tgccgcagat gcacaggccc 1260 

ccaggactgc cctcggtgct catagggaat 1320 

gtggcctggg actgctgctc tgctggctgc 1380 

aaggcatccc ccagccccaa gcctgcgcgc 1440 

ttgtggggga aggccctggc tttgtattcc 1500 

tgtcttcagt attaagtctc caactcttgt 156 0 

tagaggacac gtaggcggta ccactaaggt 1620 

1622 



<210> 16 

<211> 1385 

<212> DNA 

<213> Homo sapiens 

<220> 

<22l> misc_f eature 

<223> Incyte clone 1275743 

<400> 16 

cgcaacgggc gcaggtgcgg ggcgcgggct 
ttcggcaggc ggccggcgcg gggcgcaggc 
gtgaagagct acgactacct gctcaagttc 
ggcgagatcc tggagagcct gcaggacggc 
gggatcgact acaagaccac caccatcctg 
tgggacacgt cgggccaggg ccggttctgc 
caggggatcc tcttggtgta tgacatcacc 
tggatcaagg agatcgatga gcatgcaccc 
ctgcacctgg ccttcaagcg gcaggtcccg 
aactgcatga ccttctttga ggtcagcccc 
acggagctat cccgcatcgt gctcatgcgg 
cgagtgttca gcctgcagga cctctgctgc 
ctcatcgaca agcttccact gcccgtcacc 
gccaacggca tgaacgcggt catgatgcac 
gggggcggcg gcagcaaggg caacagcctc 
agcccccccc agaactgctc gcggagtaac 



ctctcacgcc gcggcctcac ccggcggtgc 6 0 
ggcgcggcca tgggctcgca gggcagtccg 120 
ctgctggtgg gcgacagcga cgtgggcaag 180 
gcggcagagt ccccgtacgc ctacagtaac 240 
ctggacggcc ggcgcgtgaa gctggagctc 300 
accatcttca ggtcctactc caggggcgct 360 
aaccgctggt cctttgacgg catcgaccgc 420 
ggagtccccc ggatcttggt tggaaaccgg 48 0 
acggagcagg cccgcgcgta cgcagagaag 540 
ctgtgcaact tcaacgtcat cgagtccttc 60 0 
cacggcatgg agaagatctg gaggcccaac 660 
cgggccatcg tctcctgcac ccccgt-gcac 72 0 
atcaagagcc acctcaagtc cttctcgatg 780 
ggccgttcct actccctggc cagcggggcc 840 
aagaggtcca agtccatccg tccaccccag 900 
tgcaagatct cctagcgggg atgggcgggg 960 
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ccgcctgtgc agatgccagg agggctcgag 
tgccgcctac gtggagactg tccacacagc 
cfcgagccggg tgcgaggagg agcatgcacg 
gcggctgggc tgctggtgct tccgggaatc 
tgcctgggct tgaccggcgg ggagcctggt 
cacttttttg tacattttta aggggccttc 
gcactggtga cttcatggcc acgccagctg 
gactc 



ctggacactc ctggctggac gccaggccag 102 0 
tgcctcagaa gcgccgggct ttcctcacac 1080 
gaccaagcgc ggcaggcggg aggagggggc 1140 
ttggtcggaa acaagccggg cctccccagc 1200 
tggcctttct tatttatata gagaacactt 126 0 
agggaagcct gggtgtggcc cggtggtggt 1320 
cggggacgca cttgggactc ctcgagaggg 1380 

1385 



<210> 17 

<211> 2790 

<212> UNA 

<213> Homo sapiens 

<220> 

<221> Tnisc_f eature 

<223> Incyte clone 1722533 



<400> 17 

cccggaggaa cggaaggcag gattgcagct tcctcagtgc aacctccaaa caggaaatct 60 
gagatgtata acatcttata gtttggcttg tcaacgttgg tcatgcggtg gccccaaaat 12 0 
aaactccctg cttcaaagga cagcgtttca gaactgcctg gcagagcagc cagaagcttg 18 0 
gggccagggc agaaggaaaa ctcggggagc atgttctgaa ttaagacact ttcaagaaaa 240 
tcctttgtat tacccctgaa ttgtaccctt gtttcagagc ctaacagggt tttctgattt 300 
gctgttccct cctccccact gggtgtctgt tctggaggcc agggtgagag gtggaggagg 360 
atggccacgc agatcagcac tcggggcagc cagtgtacca ttgggcagga ggagtacagc 42 0 
ctgtacagca gcctgagcga ggatgaactg gtgcagatgg ccatcgagca gagcctagcg 480 
gacaagacaa ggggcccaac cactgctgag gccaccgcgt ctgcatgtac caaccgccaa 540 
cctgcccatt tctacccatg gaccaggtcc actgcacctc ctgagagttc gccggcccgg 600 
gccccaatgg gcttgttcca aggggtcatg cagaaataca gcagcagctt gttcaagacc 66 0 
tcccagctgg cgcctgcgga ccccttgata aaggccatca aggatggcga tgaagaggcc 72 0 
ttgaagacca tgatcaagga agggaagaat ctcgcagagc ccaacaagga gggctggctg 78 0 
ccgctgcacg aggccgcata ctatggccag gtgggctgcc tgaaagtcct gcagcgagcg 840 
tacccaggga ccatcgacca gcgcaccctg caggaggaaa cagccgttta cttggcaacg 900 
tgcaggggcc acctggactg tctcctgtca ctgctccaag caggggcaga gccggacatc 960 
tccaacaaat cccgagagac accgctctac aaagcctgtg agcgcaagaa cgcggaggcc 1020 
gtgaagattc tggtgcagca caacgcagac accaaccacc gctgcaaccg cggctggacc 1080 
gctctgcacg agtctgtgtc tcgcaatgac ctggaggtca tgcagatcct ggtgagcgga 1140 
ggagccaagg tggaatccaa gaacgcctac ggcatcaccc ccttgttcgt ggccgcccag 1200 
agtggacagt tggaggcctt gaggttctta gccaagtacg gtgctgacat caacacgcag 126 0 
gccagcgaca acgcgtctgc cctctacgag gcctgcaaga atgagcatga ggaggtggtg 1320 
gagtttctgc tgtcacaggg tgccgacgcc aacaagacca acaaggacgg cttgctcccg 1380 
ctgcacatcg cctccaagaa gggcaactac aggatcgtgc agatgctgct gccggtgacc 1440 
agccgcacgc gcatacgccg tagcggcgtc agtccgctgc acctggcggc cgagcgcaac 1500 
cacgacgagg tgctggaggc gctgctgagc gcgcgcttcg acgtgaacac gccgctggcg 156 0 
cccgaacgcg cgcgcctcta cgaagaccgg cgcacgtccg cgctgtactt cgcggtggtc 162 0 
aacaacaacg tgtacgccac cgagctgctg ctgcaacacg gcgccgaccc caaccgcgac 16 80 
gtcatcagcc ccttgctcgt ggccatccgc cacggctgcc tgcgcacaat gcagctgctg 174 0 
ctggaccacg gcgcgaacat cgacgcctat atcgccacgc accccaccgc cttccccgcc 18 00 
accatcatgt tcgccatgaa gtgcctgtcg ctgctcaagt tcctcatgga cctgggctgc 186 0 
gacggcgagc cctgcttctc atgcctctac ggcaacggcc cgcacccgcc ggccccgcag 192 0 
ccctccagca ggttcaacga cgcgcccgcg gccgacaagg agcccagcgt ggtgcagttc 1980 
tgtgagttcg tatctgcccc agaggtgagc cgctgggcgg ggcccatcat cgatgtcctc 2040 
ctggactacg tgggcaacgt gcagctctgc tcgcggctga aggaacacat cgacagcttt 2100 
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gaggactggg ccgtcatcaa ggagaaggca gaacctccaa gacctctggc tcacctttgc 2160 
cgactgcggg ttcgaaaggc cattgggaaa taccgtataa aactcctaga caccttgccg 222 0 
ctcccaggca ggctgattag atacctgaaa tacgagaaca cccagtaact ggggccacgg 2280 
ggagagagga gtagcccctc agactcttct tactaagtct caggacgtcg gtgttcccaa 2340 
cfcccaagggg acctggtgac agacgaggct gcaggctgcc tccctctcag cctggacagc 2400 
taccaggatc tcactgggtc tcagggccca gagctttggc cagagcagag aacagaatgt 2460 
gtcaaggaga agaatcattt gtttacaaac tgatgagcag atcccagacc ttctctacct 2520 
tcaggaatgg cagaaacctc tattcctggg gccagggcag agcttgaggt gttctgggga 2580 
aggtggtgct cagagccttc cctgtgcccc tccacttgtt ctggaaaact caccacttga 2640 
cttcagagct ttctctccaa agactaagat gaagacgtgg cccaaggtag ggggtagggg 2700 
gagcctgggt cttggagggc tttgttaagt attaatataa taaatgttac acatgtgaca 2760 
cctgcccagt gaaaaactaa aaaaaaaaaa 2790 



<210> 18 
<211> 2263 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte clone 1759763 

<400> 18 

cggaccgtcc gcctacgggg gccgggacgt 
gcagcatcgc ggtggggatc gaaagcgggg 
ctcggaccag ccatttcggt gtagaagtgg 
tttacagagg cttacgcgga cacgtgctct 
gttaaagtct taaggaaact gctcaaaaag 
ggatggatgc caattcatga agcagcttat 
attaatgcag attcatctga aaactacatt 
ttgcatctcg ctgcaagtca aggacattgg 
gcagatccta atgcaactac tttagaagaa 
ggacagatag atgtgttaag gctzgttgctt 
tctatgtgtg gatggaactc cttgcaccag 
aaattgcttc ttagaaaagg agcaaacaag 
ttatttgtgg ctgctcagta tggcaagcta 
gcaaatgtca attgtcaagc cttggacaaa 
ggacacacaa aatgtgtgga gcttttgctc 
aatgaggaca gttggcagtt acctattcat 
ttggacttgt taataccact tactaaccgg 
cctgtttact cagcagtgtt tgggggacat 
ggctacagcc cagacgccca ggcgtgcctt 
gctttccaaa aggactgtga gttctttgga 
cagataaatg aacttcattt ggcatactgc 
tactttttga ggaaaggttg ctcattggga 
catgcaatta aagcacaagc aaaatataag 
tttgacccac tgattctact gtgcaattct 
atcttcactt tggagtttac taattggaag 
tctgctcgtg cctcaaacgc ttggattcta 
acccatcttt gtcgtttgga aattcggtcc 
agttatatta gtcagctgcc acttcccaga 
gttctgagga tgtatgaagt tccagaactg 
actacttaac acagcfcaatt ttttfcctctg 
tacaagtttt tatgatttta tagtcaaaag 
ttggggggcc agtagttcag tgagaatgtt: 
aaaaaaaaaa aaaaaaaaaa aattgtaaac 



cgcctgcgcg tctctcgttt tcggacggct 60 
gcttctggga cgcagctctg gagacgcggc 12 0 
cagcacggca gactggtcaa acaaatggat 180 
acagttggac ttgctgccag ggaaggcaat 240 
ggccgaagtg tcgatgttgc tgataacagg 300 
cacaactctg tagaatgttt gcaaatgtta 360 
aagatgaaga cctttgaagg tttctgtgct 420 
aaaatcgtac agattctttt agaagctggg 480 
acgacaccat tgtttttagc tgttgaaaat 540 
caacacggag caaatgttaa tggatcccat 600 
gcttcttttc aggaaaatgc tgagatcata 660 
gaatgccagg atgactttgg aatcacacct 720 
gaaagcttga gcatacttat ttcatcaggt 780 
gctacaccct tgttcattgc tgctcaagag 840 
tccagtgggg cagatcctga tctttactgt 900 
gcagctgcac aaatgggcca tacaaaaatc 96 0 
gcctgtgaca ctgggctaaa caaagtaagc 1020 
gaagattgcc tagaaatatt actccggaat 1080 
gtttttggat tcagttctcc tgtgtgcatg 114 0 
attgtgaaca ttcttttgaa atatggagcc 1200 
ctgaagtacg agaagttttc gatatttcgc 1260 
ccatggaacc atatatatga atttgtaaat 132 0 
gagtggttgc cacatcttct ggttgctgga 1380 
tggattgact cagtcagcat tgacaccctt 1440 
acacttgcac cagctgttga aaggatgctc 1500 
cagcaacata ttgccactgt tccatccctg 1560 
agtctaaaat cagaacgtct acggtctgac 1620 
agcctacata attatttgct ctatgaagac 1680 
gcagctattc aagatggata aatcagtgaa 1740 
aaaaatcatc gagacaaaag agccacagag 1800 
atgattattg attgtcagat aggttaggtt 1860 
tatgtttaca actagccttc ccagtaaaaa 1920 
atcacttata ttactttatt gcagcttcat 1980 
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caccagtaca ttatatgttg taatatttat: 
tattttgcta ataaactgtg atgttacttc 
gattcgtgtt tacctggtat taggagctca 
gtatacaact gtggtgtaga atctgttatt 
tatctcttat gtttattaaa gaacatgttt 



ttacctgatc attttgatca ttttctgctit 204 0 
tagtgctaaa catggcatat ttccacctat 2100 
gaatggaatg cataaagctt cactggaagt 216 0 
attatcatta ttattttatt tagacttgac 2220 
tcctaaaaaa aaa 2263 
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